QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code
Hainan Fang, Yuanbo Wen, Jun Bi, Yihan Wang, Tonghui He, Yanlin Tang, Di Huang, Jiaming Guo, Rui Zhang, Qi Guo, Yunji Chen

TL;DR
This paper introduces NeuComBack, a benchmark dataset and a self-evolving prompt optimization method that significantly improves the correctness and performance of LLM-generated assembly code from IR, advancing neural compilation.
Contribution
It presents a dedicated IR-to-assembly benchmark, evaluates LLMs' neural compilation capabilities, and proposes a novel self-evolving prompt technique to enhance code correctness and efficiency.
Findings
Functional correctness improved from 44% to 64% on x86_64.
Functional correctness improved from 36% to 58% on aarch64.
Most correctly generated x86_64 programs outperformed clang-O3.
Abstract
Compilers, while essential, are notoriously complex systems that demand prohibitively expensive human expertise to develop and maintain. The recent advancements in Large Language Models (LLMs) offer a compelling new paradigm: Neural Compilation, which could potentially simplify compiler development for new architectures and facilitate the discovery of innovative optimization techniques. However, several critical obstacles impede its practical adoption. Firstly, a significant lack of dedicated benchmarks and robust evaluation methodologies hinders objective assessment and tracking of progress in the field. Secondly, systematically enhancing the reliability and performance of LLM-generated assembly remains a critical challenge. Addressing these challenges, this paper introduces NeuComBack, a novel benchmark dataset specifically designed for IR-to-assembly compilation. Leveraging this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Software Engineering Research · Embedded Systems Design Techniques
