Agentic Fuzzing: Opportunities and Challenges
Junyoung Park, Insu Yun

TL;DR
Agentic fuzzing leverages deep reasoning agents to identify complex logic bugs in codebases by analyzing root causes, hypothesizing scenarios, and verifying through generated code, showing promising results in JavaScript engines.
Contribution
This paper introduces agentic fuzzing, a novel approach using deep agents for reasoning to find logic bugs, addressing challenges like harness engineering and seed scheduling.
Findings
Found 40 bugs in V8 JavaScript engine within one month
Discovered 19 bugs in SpiderMonkey and JavaScriptCore using V8 seeds
Generated $35,000 in bounties and received two CVEs
Abstract
Fuzzers and static analyzers find many bugs but struggle with logic bugs in mature codebases. Triggering such a bug often requires multi-step reasoning that produces no distinctive execution feedback, and variants can appear across implementations too different for a single pattern to match. Recent LLM-assisted approaches help, but they use LLMs as auxiliaries rather than as the reasoning engine. We propose agentic fuzzing, a bug-finding approach seeded by historical bugs in which deep agents perform the reasoning directly. Given a reference bug, the agent analyzes its root cause, hypothesizes new scenarios elsewhere in the codebase that may share that cause, and verifies each hypothesis by generating and running proof-of-concept code. This lets the agent find variants that differ completely in trigger path or code structure from the reference. We identify three practical challenges…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
