Leveraging Models to Reduce Test Cases in Software Repositories
Golnaz Gharachorlu, Nick Sumner

TL;DR
This paper introduces a model-guided approach to accelerate test case reduction in software repositories by predicting the semantic validity of candidates, significantly reducing trial counts and reduction time.
Contribution
It presents a novel model that predicts semantic properties to guide test case reduction, outperforming existing syntactic-only methods in efficiency.
Findings
30% reduction in reduction time
14% to 61% fewer removal trials
77% average precision in predictions
Abstract
Given a failing test case, test case reduction yields a smaller test case that reproduces the failure. This process can be time consuming due to repeated trial and error with smaller test cases. Current techniques speed up reduction by only exploring syntactically valid candidates, but they still spend significant effort on semantically invalid candidates. In this paper, we propose a model-guided approach to speed up test case reduction. The approach trains a model of semantic properties driven by syntactic test case properties. By using this model, we can skip testing even syntactically valid test case candidates that are unlikely to succeed. We evaluate this model-guided reduction on a suite of 14 large fuzzer-generated C test cases from the bug repositories of two well-known C compilers, GCC and Clang. Our results show that with an average precision of 77%, we can decrease the number…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
