ELFuzz: Efficient Input Generation via LLM-driven Synthesis Over Fuzzer Space
Chuyang Chen, Brendan Dolan-Gavitt, Zhiqiang Lin

TL;DR
ELFuzz is an automated, LLM-driven approach that synthesizes efficient fuzzers tailored to large real-world systems, significantly improving code coverage and bug detection over existing methods.
Contribution
ELFuzz introduces a novel automated synthesis method for fuzzers using LLMs, capable of scaling to large systems and capturing complex input structures.
Findings
Achieves up to 434.8% more coverage than state-of-the-art methods.
Triggers up to 216.7% more artificially injected bugs.
Found five zero-day bugs in a real-world fuzzing campaign.
Abstract
Generation-based fuzzing produces appropriate test cases according to specifications of input grammars and semantic constraints to test systems and software. However, these specifications require significant manual effort to construct. This paper proposes a new approach, ELFuzz (Evolution Through Large Language Models for Fuzzing), that automatically synthesizes generation-based fuzzers tailored to a system under test (SUT) via LLM-driven synthesis over fuzzer space. At a high level, it starts with minimal seed fuzzers and propels the synthesis by fully automated LLM-driven evolution with coverage guidance. Compared to previous approaches, ELFuzz can 1) seamlessly scale to SUTs of real-world sizes -- up to 1,791,104 lines of code in our evaluation -- and 2) synthesize efficient fuzzers that catch interesting grammatical structures and semantic constraints in a human-understandable way.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software Engineering Techniques and Practices
