Beyond Imprecise Distance Metrics: Trace-Guided Directed Greybox Fuzzing via LLM-Predicted Call Stacks
Yifan Zhang, Xin Zhang

TL;DR
This paper introduces TDGF, a novel fuzzing approach that uses LLM-predicted call stacks and execution traces to more accurately target vulnerabilities, significantly improving bug detection efficiency.
Contribution
The paper proposes trace-guided directed greybox fuzzing (TDGF) that replaces static analysis with execution trace guidance and leverages LLMs for call stack prediction, enhancing vulnerability detection.
Findings
TDGF triggers vulnerabilities 2.13× to 3.14× faster than baselines.
The call-stack trace guidance is easier for LLMs to predict than control-flow traces.
The approach discovered 10 new vulnerabilities and 2 incomplete fixes.
Abstract
Directed greybox fuzzing (DGF) aims to efficiently trigger bugs at specific target locations by prioritizing seeds whose execution paths are more likely to reach the targets. However, existing DGF approaches suffer from imprecise potential estimation due to their reliance on static-analysis-based distance metrics. The over-approximation inherent in static analysis causes many seeds with execution paths irrelevant to vulnerability triggering to be mistakenly prioritized, significantly reducing fuzzing efficiency. To address this issue, we propose trace-guided directed greybox fuzzing (TDGF). TDGF replaces static-analysis-based distance metrics with vulnerability-oriented execution information (referred to as guidance traces) to steer directed fuzzing: seeds whose execution paths overlap more with the guidance traces are scheduled earlier for mutation. We empirically study two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
