SmartSeed: Smart Seed Generation for Efficient Fuzzing
Chenyang Lyu, Shouling Ji, Yuwei Li, Junfeng Zhou, Jianhai Chen, Jing, Chen

TL;DR
SmartSeed employs machine learning to generate high-value seed files rapidly, significantly enhancing fuzzing efficiency and vulnerability discovery across multiple applications and input formats.
Contribution
It introduces a novel machine learning-based seed generation system that outperforms existing strategies in fuzzing efficiency and vulnerability detection.
Findings
Generates high-value seeds within tens of seconds
Significantly improves fuzzing performance and crash discovery
Discovers 16 new vulnerabilities with assigned CVE IDs
Abstract
Fuzzing is an automated application vulnerability detection method. For genetic algorithm-based fuzzing, it can mutate the seed files provided by users to obtain a number of inputs, which are then used to test the objective application in order to trigger potential crashes. As shown in existing literature, the seed file selection is crucial for the efficiency of fuzzing. However, current seed selection strategies do not seem to be better than randomly picking seed files. Therefore, in this paper, we propose a novel and generic system, named SmartSeed, to generate seed files towards efficient fuzzing. Specifically, SmartSeed is designed based on a machine learning model to learn and generate high-value binary seeds. We evaluate SmartSeed along with American Fuzzy Lop (AFL) on 12 open-source applications with the input formats of mp3, bmp or flv. We also combine SmartSeed with different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Advanced Malware Detection Techniques · Software Reliability and Analysis Research
