Harnessing Large Language Models for Seed Generation in Greybox Fuzzing
Wenxuan Shi, Yunhang Zhang, Xinyu Xing, Jun Xu

TL;DR
This paper presents SeedMind, a system leveraging Large Language Models to generate high-quality seeds for greybox fuzzing, improving bug discovery efficiency and coverage through an iterative, feedback-driven approach.
Contribution
Introduces SeedMind, a novel LLM-based seed generation system for greybox fuzzing that addresses input format and context limitations with an iterative refinement process.
Findings
SeedMind generates test cases comparable to human-created seeds.
It significantly outperforms existing LLM-based seed generation methods.
The system enhances bug detection and code coverage in real-world applications.
Abstract
Greybox fuzzing has emerged as a preferred technique for discovering software bugs, striking a balance between efficiency and depth of exploration. While research has focused on improving fuzzing techniques, the importance of high-quality initial seeds remains critical yet often overlooked. Existing methods for seed generation are limited, especially for programs with non-standard or custom input formats. Large Language Models (LLMs) has revolutionized numerous domains, showcasing unprecedented capabilities in understanding and generating complex patterns across various fields of knowledge. This paper introduces SeedMind, a novel system that leverages LLMs to boost greybox fuzzing through intelligent seed generation. Unlike previous approaches, SeedMind employs LLMs to create test case generators rather than directly producing test cases. Our approach implements an iterative,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Service-Oriented Architecture and Web Services · Web Applications and Data Management
