Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback
Shiyin Lin

TL;DR
This paper introduces a hybrid fuzzing framework that combines static and dynamic analysis with Large Language Model-guided input mutation and semantic feedback, leading to faster and more diverse vulnerability discovery.
Contribution
It presents a novel approach integrating LLMs with semantic feedback in fuzzing, improving efficiency and depth of vulnerability detection over existing methods.
Findings
Faster time-to-first-bug compared to state-of-the-art fuzzers
Higher semantic diversity in generated inputs
Competitive number of unique bugs discovered
Abstract
Software fuzzing has become a cornerstone in automated vulnerability discovery, yet existing mutation strategies often lack semantic awareness, leading to redundant test cases and slow exploration of deep program states. In this work, I present a hybrid fuzzing framework that integrates static and dynamic analysis with Large Language Model (LLM)-guided input mutation and semantic feedback. Static analysis extracts control-flow and data-flow information, which is transformed into structured prompts for the LLM to generate syntactically valid and semantically diverse inputs. During execution, I augment traditional coverage-based feedback with semantic feedback signals-derived from program state changes, exception types, and output semantics-allowing the fuzzer to prioritize inputs that trigger novel program behaviors beyond mere code coverage. I implement our approach atop AFL++,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Advanced Malware Detection Techniques
