LibLMFuzz: LLM-Augmented Fuzz Target Generation for Black-box Libraries
Ian Hardgrove, John D. Hastings

TL;DR
LibLMFuzz leverages large language models to autonomously generate fuzzing drivers for closed-source libraries, achieving full API coverage and high correctness without human intervention, thus reducing fuzzing costs.
Contribution
This paper introduces LibLMFuzz, a novel LLM-based framework that automates fuzz target generation for black-box libraries, significantly reducing manual effort and increasing coverage.
Findings
Achieved 100% API coverage on four Linux libraries.
Generated 558 syntactically correct fuzzing drivers with no human input.
75.52% of drivers were correct on first execution.
Abstract
A fundamental problem in cybersecurity and computer science is determining whether a program is free of bugs and vulnerabilities. Fuzzing, a popular approach to discovering vulnerabilities in programs, has several advantages over alternative strategies, although it has investment costs in the form of initial setup and continuous maintenance. The choice of fuzzing is further complicated when only a binary library is available, such as the case of closed-source and proprietary software. In response, we introduce LibLMFuzz, a framework that reduces costs associated with fuzzing closed-source libraries by pairing an agentic Large Language Model (LLM) with a lightweight tool-chain (disassembler/compiler/fuzzer) to autonomously analyze stripped binaries, plan fuzz strategies, generate drivers, and iteratively self-repair build or runtime errors. Tested on four widely-used Linux libraries,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
