Can Old Tests Do New Tricks for Resolving SWE Issues?
Yang Chen, Toufique Ahmed, Reyhaneh Jabbarvand, Martin Hirzel

TL;DR
TestPrune is an automated technique that reuses and minimizes regression tests to improve bug reproduction and patch validation, boosting issue resolution rates with minimal API costs.
Contribution
It introduces TestPrune, a novel method for automatically minimizing regression test suites to enhance bug reproduction and patch validation in bug repair pipelines.
Findings
TestPrune increases issue reproduction rate by up to 9%
TestPrune improves issue resolution rate by up to 13%
Minimal API cost overhead of $0.02-$0.05 per instance
Abstract
Test suites in real-world projects are often large and achieve high code coverage, yet they remain insufficient for detecting all bugs. The abundance of unresolved issues in open-source project trackers highlights this gap. While regression tests are typically designed to ensure past functionality is preserved in the new version, they can also serve a complementary purpose: debugging the current version. Specifically, regression tests can (1) enhance the generation of reproduction tests for newly reported issues, and (2) validate that patches do not regress existing functionality. We present TestPrune, a fully automated technique that leverages issue tracker reports and strategically reuses regression tests for both bug reproduction and patch validation. A key contribution of TestPrune is its ability to automatically minimize the regression suite to a small, highly relevant subset of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
