HarnessAgent: Scaling Automatic Fuzzing Harness Construction with Tool-Augmented LLM Pipelines
Kang Yang, Yunhang Zhang, Zichuan Li, Guanhong Tao, Jun Xu, Xiaojing Liao

TL;DR
HarnessAgent is a tool-augmented framework that significantly improves the scalability and reliability of automatic harness construction for program fuzzing, especially for internal functions, by addressing context, validation, and retrieval challenges.
Contribution
It introduces a novel framework with rule-based error minimization, a hybrid tool pool for source code retrieval, and an enhanced validation pipeline, enabling scalable and accurate harness generation across large projects.
Findings
Increases three-shot success rate by ~20% over state-of-the-art methods.
Achieves over 75% coverage increase in fuzzing targets.
Attains over 90% source code retrieval response rate, outperforming baselines.
Abstract
Large language model (LLM)-based techniques have achieved notable progress in generating harnesses for program fuzzing. However, applying them to arbitrary functions (especially internal functions) \textit{at scale} remains challenging due to the requirement of sophisticated contextual information, such as specification, dependencies, and usage examples. State-of-the-art methods heavily rely on static or incomplete context provisioning, causing failure of generating functional harnesses. Furthermore, LLMs tend to exploit harness validation metrics, producing plausible yet logically useless code. % Therefore, harness generation across large and diverse projects continues to face challenges in reliable compilation, robust code retrieval, and comprehensive validation. To address these challenges, we present HarnessAgent, a tool-augmented agentic framework that achieves fully automated,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Advanced Malware Detection Techniques
