Easy over Hard: A Simple Baseline for Test Failures Causes Prediction
Zhipeng Gao, Zhipeng Xue, Xing Hu, Weiyi Shang, Xin Xia

TL;DR
This paper introduces NCChecker, a simple yet effective method for automatically predicting test failure causes from logs, significantly reducing manual analysis time and handling data imbalance efficiently.
Contribution
The paper presents a novel approach combining log abstraction and a lookup table with heuristic rules for failure cause prediction, outperforming benchmarks.
Findings
High prediction accuracy on industrial datasets
Efficient and memory-saving approach
Effective handling of data imbalance
Abstract
The test failure causes analysis is critical since it determines the subsequent way of handling different types of bugs, which is the prerequisite to get the bugs properly analyzed and fixed. After a test case fails, software testers have to inspect the test execution logs line by line to identify its root cause. However, manual root cause determination is often tedious and time-consuming, which can cost 30-40% of the time needed to fix a problem. Therefore, there is a need for automatically predicting the test failure causes to lighten the burden of software testers. In this paper, we present a simple but hard-to-beat approach, named NCChecker to automatically identify the failure causes for failed test logs. Our approach can help developers efficiently identify the test failure causes, and flag the most probable log lines of indicating the root causes for investigation. Our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
