Latent Idiom Recognition for a Minimalist Functional Array Language using Equality Saturation

Jonathan Van der Cruysse, Christophe Dubach

Published in Proceedings of the 22nd ACM/IEEE International Symposium on Code Generation and Optimization, 2024

Accelerating programs is typically done by recognizing code idioms matching high-performance libraries or hardware interfaces. However, recognizing such idioms automatically is challenging. The idiom recognition machinery is difficult to write and requires expert knowledge. In addition, slight variations in the input program might hide the idiom and defeat the recognizer.

This paper advocates for the use of a minimalist functional array language supporting a small, but expressive, set of operators. The minimalist design leads to a tiny sets of rewrite rules, which encode the language semantics. Crucially, the same minimalist language is also used to encode idioms. This removes the need for hand-crafted analysis passes, or for having to learn a complex domain-specific language to define the idioms.

Coupled with equality saturation, this approach is able to match the core functions from the BLAS and PyTorch libraries on a set of computational kernels. Compared to reference C kernel implementations, the approach produces a geometric mean speedup of 1.46x for C programs using BLAS, when generating such programs from the high-level minimalist language.

Download preprint here. An artifact is available on Zenodo.

BibTeX citation:

@inproceedings{vandercruysse2024liar,
    author = {Van der Cruysse, Jonathan and Dubach, Christophe},
    title = {Latent Idiom Recognition for a Minimalist Functional Array Language using Equality Saturation},
    year = {2024},
    url = {https://jonathanvdc.github.io/files/2024-cgo-latent-idiom-recognition.pdf},
    abstract = {Accelerating programs is typically done by recognizing code idioms matching high-performance libraries or hardware interfaces. However, recognizing such idioms automatically is challenging. The idiom recognition machinery is difficult to write and requires expert knowledge. In addition, slight variations in the input program might hide the idiom and defeat the recognizer.
    
    This paper advocates for the use of a minimalist functional array language supporting a small, but expressive, set of operators. The minimalist design leads to a tiny sets of rewrite rules, which encode the language semantics. Crucially, the same minimalist language is also used to encode idioms. This removes the need for hand-crafted analysis passes, or for having to learn a complex domain-specific language to define the idioms.
    
    Coupled with equality saturation, this approach is able to match the core functions from the BLAS and PyTorch libraries on a set of computational kernels. Compared to reference C kernel implementations, the approach produces a geometric mean speedup of 1.46x for C programs using BLAS, when generating such programs from the high-level minimalist language.},
    booktitle = {Proceedings of the 22nd ACM/IEEE International Symposium on Code Generation and Optimization},
    numpages = {9},
    keywords = {equality saturation, functional programming, array programming, pattern matching, libraries},
    location = {Edinburgh, United Kingdom},
    series = {CGO '24}
}