Change And Cover: Last-Mile, Pull Request-Based Regression Test Augmentation
Zitong Zhou, Matteo Paltenghi, Miryung Kim, Michael Pradel

TL;DR
ChaCo is an LLM-based technique that generates targeted regression tests for pull requests, improving coverage of untested lines and integrating seamlessly into existing test suites, thus enhancing software quality efficiently.
Contribution
ChaCo introduces a PR-specific test augmentation method using LLMs, with techniques for extracting relevant test context and integrating tests into existing suites, addressing last-mile coverage gaps.
Findings
Achieves 30% full patch coverage on PRs
Tests are highly relevant and well integrated, rated 4.7/5
Exposes and fixes previously unknown bugs
Abstract
Software is in constant evolution, with developers frequently submitting pull requests (PRs) to introduce new features or fix bugs. Testing PRs is critical to maintaining software quality. Yet, even in projects with extensive test suites, some PR-modified lines remain untested, leaving a "last-mile" regression test gap. Existing test generators typically aim to improve overall coverage, but do not specifically target the uncovered lines in PRs. We present Change And Cover (ChaCo), an LLM-based test augmentation technique that addresses this gap. It makes three contributions: (i) ChaCo considers the PR-specific patch coverage, offering developers augmented tests for code just when it is on the developers' mind. (ii) We identify providing suitable test context as a crucial challenge for an LLM to generate useful tests, and present two techniques to extract relevant test content, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software System Performance and Reliability
