Growing a Test Corpus with Bonsai Fuzzing

Vasudev Vikram; Rohan Padhye; Koushik Sen

arXiv:2103.04388·cs.SE·March 9, 2021

Growing a Test Corpus with Bonsai Fuzzing

Vasudev Vikram, Rohan Padhye, Koushik Sen

PDF

Open Access 1 Repo

TL;DR

Bonsai Fuzzing is a coverage-guided, grammar-based technique that incrementally constructs concise test inputs for programs, improving test corpus size efficiency while maintaining coverage and fault detection.

Contribution

This paper introduces Bonsai Fuzzing, a novel iterative deepening approach for growing concise test inputs directly, reducing test corpus size without sacrificing effectiveness.

Findings

01

Test inputs are 16-45% smaller on average.

02

Maintains similar code coverage as traditional fuzzing.

03

Achieves comparable fault detection capabilities.

Abstract

This paper presents a coverage-guided grammar-based fuzzing technique for automatically generating a corpus of concise test inputs for programs such as compilers. We walk-through a case study of a compiler designed for education and the corresponding problem of generating meaningful test cases to provide to students. The prior state-of-the-art solution is a combination of fuzzing and test-case reduction techniques such as variants of delta-debugging. Our key insight is that instead of attempting to minimize convoluted fuzzer-generated test inputs, we can instead grow concise test inputs by construction using a form of iterative deepening. We call this approach Bonsai Fuzzing. Experimental results show that Bonsai Fuzzing can generate test corpora having inputs that are 16--45% smaller in size on average as compared to a fuzz-then-reduce approach, while achieving approximately the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vasumv/bonsai-fuzzing
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software Reliability and Analysis Research