Revisiting Neural Program Smoothing for Fuzzing

Maria-Irina Nicolae; Max Eisele; Andreas Zeller

arXiv:2309.16618·cs.SE·September 29, 2023

Revisiting Neural Program Smoothing for Fuzzing

Maria-Irina Nicolae, Max Eisele, Andreas Zeller

PDF

3 Repos

TL;DR

This paper critically evaluates neural program smoothing (NPS) fuzzers, revealing their limitations compared to standard fuzzers, and introduces improved methods and benchmarking tools for machine learning-based fuzzing.

Contribution

It provides the first extensive evaluation of NPS fuzzers, analyzes the role of machine learning in them, and offers new guidelines and tools for benchmarking ML-based fuzzers.

Findings

01

Original NPS performance claims are not supported by extensive testing.

02

Addressing practical limitations improves NPS performance but standard fuzzers still outperform them.

03

Introduces MLFuzz platform for reproducible benchmarking of ML-based fuzzers.

Abstract

Testing with randomly generated inputs (fuzzing) has gained significant traction due to its capacity to expose program vulnerabilities automatically. Fuzz testing campaigns generate large amounts of data, making them ideal for the application of machine learning (ML). Neural program smoothing (NPS), a specific family of ML-guided fuzzers, aims to use a neural network as a smooth approximation of the program target for new test case generation. In this paper, we conduct the most extensive evaluation of NPS fuzzers against standard gray-box fuzzers (>11 CPU years and >5.5 GPU years), and make the following contributions: (1) We find that the original performance claims for NPS fuzzers do not hold; a gap we relate to fundamental, implementation, and experimental limitations of prior works. (2) We contribute the first in-depth analysis of the contribution of machine learning and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.