Agentic Fuzzing: Opportunities and Challenges

Junyoung Park; Insu Yun

arXiv:2605.10074·cs.CR·May 12, 2026

Agentic Fuzzing: Opportunities and Challenges

Junyoung Park, Insu Yun

PDF

TL;DR

Agentic fuzzing leverages deep reasoning agents to identify complex logic bugs in codebases by analyzing root causes, hypothesizing scenarios, and verifying through generated code, showing promising results in JavaScript engines.

Contribution

This paper introduces agentic fuzzing, a novel approach using deep agents for reasoning to find logic bugs, addressing challenges like harness engineering and seed scheduling.

Findings

01

Found 40 bugs in V8 JavaScript engine within one month

02

Discovered 19 bugs in SpiderMonkey and JavaScriptCore using V8 seeds

03

Generated $35,000 in bounties and received two CVEs

Abstract

Fuzzers and static analyzers find many bugs but struggle with logic bugs in mature codebases. Triggering such a bug often requires multi-step reasoning that produces no distinctive execution feedback, and variants can appear across implementations too different for a single pattern to match. Recent LLM-assisted approaches help, but they use LLMs as auxiliaries rather than as the reasoning engine. We propose agentic fuzzing, a bug-finding approach seeded by historical bugs in which deep agents perform the reasoning directly. Given a reference bug, the agent analyzes its root cause, hypothesizes new scenarios elsewhere in the codebase that may share that cause, and verifies each hypothesis by generating and running proof-of-concept code. This lets the agent find variants that differ completely in trigger path or code structure from the reference. We identify three practical challenges…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.