Improving Test Distance for Failure Clustering with Hypergraph Modelling
Gabin An, Juyeon Yoon, Joyce Jiyoung Whang, Shin Yoo

TL;DR
This paper introduces Hybiscus, a hypergraph-based test distance metric that significantly improves failure clustering accuracy in automated debugging, enabling more effective fault localization in multi-fault scenarios.
Contribution
Hybiscus presents a novel hypergraph-based test distance metric that enhances failure clustering accuracy for multi-fault debugging, outperforming existing methods.
Findings
Achieved perfect clustering in 418 out of 605 test runs.
Enabled separation of different root causes for better fault localization.
Saved up to 82% of debugging effort compared to state-of-the-art techniques.
Abstract
Automated debugging techniques, such as Fault Localisation (FL) or Automated Program Repair (APR), are typically designed under the Single Fault Assumption (SFA). However, in practice, an unknown number of faults can independently cause multiple test case failures, making it difficult to allocate resources for debugging and to use automated debugging techniques. Clustering algorithms have been applied to group the test failures according to their root causes, but their accuracy can often be lacking due to the inherent limits in the distance metrics for test cases. We introduce a new test distance metric based on hypergraphs and evaluate their accuracy using multi-fault benchmarks that we have built on top of Defects4J and SIR. Results show that our technique, Hybiscus, can automatically achieve perfect clustering (i.e., the same number of clusters as the ground truth number of root…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software System Performance and Reliability
