Easy over Hard: A Simple Baseline for Test Failures Causes Prediction

Zhipeng Gao; Zhipeng Xue; Xing Hu; Weiyi Shang; Xin Xia

arXiv:2405.02922·cs.SE·May 7, 2024

Easy over Hard: A Simple Baseline for Test Failures Causes Prediction

Zhipeng Gao, Zhipeng Xue, Xing Hu, Weiyi Shang, Xin Xia

PDF

TL;DR

This paper introduces NCChecker, a simple yet effective method for automatically predicting test failure causes from logs, significantly reducing manual analysis time and handling data imbalance efficiently.

Contribution

The paper presents a novel approach combining log abstraction and a lookup table with heuristic rules for failure cause prediction, outperforming benchmarks.

Findings

01

High prediction accuracy on industrial datasets

02

Efficient and memory-saving approach

03

Effective handling of data imbalance

Abstract

The test failure causes analysis is critical since it determines the subsequent way of handling different types of bugs, which is the prerequisite to get the bugs properly analyzed and fixed. After a test case fails, software testers have to inspect the test execution logs line by line to identify its root cause. However, manual root cause determination is often tedious and time-consuming, which can cost 30-40% of the time needed to fix a problem. Therefore, there is a need for automatically predicting the test failure causes to lighten the burden of software testers. In this paper, we present a simple but hard-to-beat approach, named NCChecker to automatically identify the failure causes for failed test logs. Our approach can help developers efficiently identify the test failure causes, and flag the most probable log lines of indicating the root causes for investigation. Our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.