Leveraging Models to Reduce Test Cases in Software Repositories

Golnaz Gharachorlu; Nick Sumner

arXiv:2103.11534·cs.SE·March 23, 2021

Leveraging Models to Reduce Test Cases in Software Repositories

Golnaz Gharachorlu, Nick Sumner

PDF

TL;DR

This paper introduces a model-guided approach to accelerate test case reduction in software repositories by predicting the semantic validity of candidates, significantly reducing trial counts and reduction time.

Contribution

It presents a novel model that predicts semantic properties to guide test case reduction, outperforming existing syntactic-only methods in efficiency.

Findings

01

30% reduction in reduction time

02

14% to 61% fewer removal trials

03

77% average precision in predictions

Abstract

Given a failing test case, test case reduction yields a smaller test case that reproduces the failure. This process can be time consuming due to repeated trial and error with smaller test cases. Current techniques speed up reduction by only exploring syntactically valid candidates, but they still spend significant effort on semantically invalid candidates. In this paper, we propose a model-guided approach to speed up test case reduction. The approach trains a model of semantic properties driven by syntactic test case properties. By using this model, we can skip testing even syntactically valid test case candidates that are unlikely to succeed. We evaluate this model-guided reduction on a suite of 14 large fuzzer-generated C test cases from the bug repositories of two well-known C compilers, GCC and Clang. Our results show that with an average precision of 77%, we can decrease the number…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.