HarnessAgent: Scaling Automatic Fuzzing Harness Construction with Tool-Augmented LLM Pipelines

Kang Yang; Yunhang Zhang; Zichuan Li; Guanhong Tao; Jun Xu; Xiaojing Liao

arXiv:2512.03420·cs.CR·December 12, 2025

HarnessAgent: Scaling Automatic Fuzzing Harness Construction with Tool-Augmented LLM Pipelines

Kang Yang, Yunhang Zhang, Zichuan Li, Guanhong Tao, Jun Xu, Xiaojing Liao

PDF

Open Access

TL;DR

HarnessAgent is a tool-augmented framework that significantly improves the scalability and reliability of automatic harness construction for program fuzzing, especially for internal functions, by addressing context, validation, and retrieval challenges.

Contribution

It introduces a novel framework with rule-based error minimization, a hybrid tool pool for source code retrieval, and an enhanced validation pipeline, enabling scalable and accurate harness generation across large projects.

Findings

01

Increases three-shot success rate by ~20% over state-of-the-art methods.

02

Achieves over 75% coverage increase in fuzzing targets.

03

Attains over 90% source code retrieval response rate, outperforming baselines.

Abstract

Large language model (LLM)-based techniques have achieved notable progress in generating harnesses for program fuzzing. However, applying them to arbitrary functions (especially internal functions) \textit{at scale} remains challenging due to the requirement of sophisticated contextual information, such as specification, dependencies, and usage examples. State-of-the-art methods heavily rely on static or incomplete context provisioning, causing failure of generating functional harnesses. Furthermore, LLMs tend to exploit harness validation metrics, producing plausible yet logically useless code. % Therefore, harness generation across large and diverse projects continues to face challenges in reliable compilation, robust code retrieval, and comprehensive validation. To address these challenges, we present HarnessAgent, a tool-augmented agentic framework that achieves fully automated,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Advanced Malware Detection Techniques