MASFuzzer: Fuzz Driver Generation and Adaptive Scheduling via Multidimensional API Sequences
Xingyu Liu, Zengqin Huang, Xiang Gao, Hailong Sun

TL;DR
MASFUZZER enhances library fuzz testing by synthesizing multidimensional API sequences and employing adaptive scheduling, leading to higher code coverage and discovery of new vulnerabilities.
Contribution
It introduces a novel fuzzing framework that combines API sequence synthesis with adaptive scheduling to improve driver effectiveness and code coverage.
Findings
Achieves 8.54% higher code coverage than state-of-the-art methods.
Uncovers 16 previously unknown vulnerabilities, with 14 confirmed and 9 assigned CVEs.
Effectively generates drivers that explore untested code regions.
Abstract
Fuzz testing of software libraries relies on fuzz drivers to invoke library APIs. Traditionally, these drivers are written manually by developers - a process that is time-consuming and often inadequate for exercising complex program behaviors. While recent studies have explored the use of Large Language Models (LLMs) to automate fuzz driver generation, the resulting drivers often fail to cover deep program branches. To address these challenges, we propose MASFUZZER, a fuzzing framework that integrates multidimensional API sequence construction with adaptive fuzzing scheduling strategies to improve library testing. At its core, MASFUZZER synthesizes context-relevant API call sequences by referring to API usage examples from the codebase and applying mutation-propagation-based and semantic-aware API sequence mining. These multidimensional API sequences serve as the basis for LLMs to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
