TL;DR
This systematic literature review analyzes machine learning-based test case selection and prioritization techniques in regression testing, highlighting current methods, evaluation metrics, performance, and reproducibility issues to guide future research.
Contribution
The paper provides an in-depth analysis of 29 ML-based TSP studies from 2006 to 2020, offering a taxonomy and insights for future research directions.
Findings
ML techniques improve test case prioritization accuracy
Evaluation metrics vary widely across studies
Reproducibility of results is often limited
Abstract
Regression testing is an essential activity to assure that software code changes do not adversely affect existing functionalities. With the wide adoption of Continuous Integration (CI) in software projects, which increases the frequency of running software builds, running all tests can be time-consuming and resource-intensive. To alleviate that problem, Test case Selection and Prioritization (TSP) techniques have been proposed to improve regression testing by selecting and prioritizing test cases in order to provide early feedback to developers. In recent years, researchers have relied on Machine Learning (ML) techniques to achieve effective TSP (ML-based TSP). Such techniques help combine information about test cases, from partial and imperfect sources, into accurate prediction models. This work conducts a systematic literature review focused on ML-based TSP techniques, aiming to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
