Test Case Prioritization Using Test Similarities
Alireza Haghighatkhah, Mika M\"antyl\"a, Markku Oivo, Pasi Kuvaja

TL;DR
This study evaluates similarity-based test prioritization methods, finding that dissimilar test cases early in testing improve defect detection, with a focus on effectiveness and speed trade-offs across techniques.
Contribution
It compares five SBTP techniques on real-world datasets, highlighting the effectiveness of dissimilarity-based prioritization and the efficiency of locality-sensitive hashing.
Findings
Dissimilar test cases early improve defect detection effectiveness.
No single SBTP technique is consistently superior in effectiveness.
Locality-sensitive hashing is faster but slightly less effective.
Abstract
A classical heuristic in software testing is to reward diversity, which implies that a higher priority must be assigned to test cases that differ the most from those already prioritized. This approach is commonly known as similarity-based test prioritization (SBTP) and can be realized using a variety of techniques. The objective of our study is to investigate whether SBTP is more effective at finding defects than random permutation, as well as determine which SBTP implementations lead to better results. To achieve our objective, we implemented five different techniques from the literature and conducted an experiment using the defects4j dataset, which contains 395 real faults from six real-world open-source Java programs. Findings indicate that running the most dissimilar test cases early in the process is largely more effective than random permutation (Vargha-Delaney A [VDA]: 0.76-0.99…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software System Performance and Reliability
