LEGO-Compiler: Enhancing Neural Compilation Through Translation Composability

Shuoming Zhang; Jiacheng Zhao; Chunwei Xia; Zheng Wang; Yunji Chen; Xiaobing Feng; Huimin Cui

arXiv:2505.20356·cs.PL·May 28, 2025

LEGO-Compiler: Enhancing Neural Compilation Through Translation Composability

Shuoming Zhang, Jiacheng Zhao, Chunwei Xia, Zheng Wang, Yunji Chen, Xiaobing Feng, Huimin Cui

PDF

Open Access

TL;DR

LEGO-Compiler is a neural compilation system that uses LLMs to translate high-level code into assembly, featuring decomposition, verifiable steps, and self-correction, achieving high accuracy and scalability.

Contribution

It introduces a novel LLM-based compilation approach with formal proofs of composability, improving accuracy and scalability over existing methods.

Findings

01

Over 99% accuracy on ExeBench

02

97.9% accuracy on AnsiBench

03

Near tenfold improvement in code scalability

Abstract

Large language models (LLMs) have the potential to revolutionize how we design and implement compilers and code translation tools. However, existing LLMs struggle to handle long and complex programs. We introduce LEGO-Compiler, a novel neural compilation system that leverages LLMs to translate high-level languages into assembly code. Our approach centers on three key innovations: LEGO translation, which decomposes the input program into manageable blocks; breaking down the complex compilation process into smaller, simpler verifiable steps by organizing it as a verifiable LLM workflow by external tests; and a feedback mechanism for self-correction. Supported by formal proofs of translation composability, LEGO-Compiler demonstrates high accuracy on multiple datasets, including over 99% on ExeBench and 97.9% on industrial-grade AnsiBench. Additionally, LEGO-Compiler has also acheived near…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications