ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution
Liu Yang, Zeyu Nie, Andrew Liu, Felix Zou, Deniz Altinb\"uken, Amir Yazdanbakhsh, Quanquan C. Liu

TL;DR
ParEVO is a framework that synthesizes high-performance parallel algorithms for irregular data structures, leveraging curated datasets, fine-tuned models, and evolutionary repair techniques to significantly outperform existing models and match expert human performance.
Contribution
The paper introduces ParEVO, combining a large curated task dataset, specialized fine-tuned models, and an evolutionary code repair agent to improve parallel algorithm synthesis for irregular data.
Findings
Achieves up to 1103x speedup on benchmark tasks.
Outperforms state-of-the-art commercial models on complex irregular graph problems.
Matches expert human performance with up to 4.1x speedup on irregular kernels.
Abstract
The transition from sequential to parallel computing is essential for modern high-performance applications but is hindered by the steep learning curve of concurrent programming. This challenge is magnified for irregular data structures (such as sparse graphs, unbalanced trees, and non-uniform meshes) where static scheduling fails and data dependencies are unpredictable. Current Large Language Models (LLMs) often fail catastrophically on these tasks, generating code plagued by subtle race conditions, deadlocks, and sub-optimal scaling. We bridge this gap with ParEVO, a framework designed to synthesize high-performance parallel algorithms for irregular data. Our contributions include: (1) The Parlay-Instruct Corpus, a curated dataset of 13,820 tasks synthesized via a "Critic-Refine" pipeline that explicitly filters for empirically performant algorithms that effectively utilize Work-Span…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Big Data and Digital Economy · Graph Theory and Algorithms
