A Comprehensive Empirical Investigation on Failure Clustering in Parallel Debugging
Yi Song, Xiaoyuan Xie, Quanming Liu, Xihao Zhang, Xi Wu

TL;DR
This paper empirically investigates how various factors affect failure clustering effectiveness in parallel debugging, providing insights into optimal conditions and limitations for fault isolation techniques.
Contribution
It is the first comprehensive empirical study examining the influence of four key factors on clustering effectiveness in parallel debugging.
Findings
GP19 is highly competitive across all REFs
Clustering effectiveness decreases as the number of faults increases
Higher effectiveness is easier with predicate faults
Abstract
The clustering technique has attracted a lot of attention as a promising strategy for parallel debugging in multi-fault scenarios, this heuristic approach (i.e., failure indexing or fault isolation) enables developers to perform multiple debugging tasks simultaneously through dividing failed test cases into several disjoint groups. When using statement ranking representation to model failures for better clustering, several factors influence clustering effectiveness, including the risk evaluation formula (REF), the number of faults (NOF), the fault type (FT), and the number of successful test cases paired with one individual failed test case (NSP1F). In this paper, we present the first comprehensive empirical study of how these four factors influence clustering effectiveness. We conduct extensive controlled experiments on 1060 faulty versions of 228 simulated faults and 141 real faults,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software System Performance and Reliability · Software Engineering Research
