Discovering 100+ Compiler Defects in 72 Hours via LLM-Driven Semantic Logic Recomposition
Xingbang He, Yuanwei Chen, Hao Wu, Jikang Zhang, Zicheng Wang, Ligeng Chen, Junjie Peng, Haiyang Wei, Yi Qian, Tiantai Zhang, Linzhang Wang, Bing Mao

TL;DR
This paper introduces FeatureFuzz, a semantic-aware compiler fuzzer that leverages features derived from bug reports to generate diverse, bug-prone programs, leading to the discovery of over 100 compiler defects in 72 hours.
Contribution
FeatureFuzz is a novel semantic feature-based fuzzing approach that improves bug discovery in compilers by explicitly reusing bug-triggering semantics from historical reports.
Findings
Discovered 167 unique crashes in 24 hours, outperforming other fuzzers.
Identified 113 bugs in GCC and LLVM within 72 hours, with 97 confirmed by developers.
Enhanced bug detection by preserving semantic invariants in program generation.
Abstract
Compilers constitute the foundational root-of-trust in software supply chains; however, their immense complexity inevitably conceals critical defects. Recent research has attempted to leverage historical bugs to design new mutation operators or fine-tune models to increase program diversity for compiler fuzzing.We observe, however, that bugs manifest primarily based on the semantics of input programs rather than their syntax. Unfortunately, current approaches, whether relying on syntactic mutation or general Large Language Model (LLM) fine-tuning, struggle to preserve the specific semantics found in the logic of bug-triggering programs. Consequently, these critical semantic triggers are often lost, resulting in a limitation of the diversity of generated programs. To explicitly reuse such semantics, we propose FeatureFuzz, a compiler fuzzer that combines features to generate programs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Advanced Malware Detection Techniques
