K-Detector: Identifying Duplicate Crash Failures in Large-Scale Software   Delivery

Hao Yang; Yang Xu; Yong Li; Hyun-Deok Choi

arXiv:2205.15972·cs.SE·June 2, 2022

K-Detector: Identifying Duplicate Crash Failures in Large-Scale Software Delivery

Hao Yang, Yang Xu, Yong Li, Hyun-Deok Choi

PDF

1 Repo

TL;DR

K-Detector is a machine learning-based tool designed to automatically identify duplicate crash failures in large-scale software delivery, significantly reducing manual effort and improving efficiency.

Contribution

It introduces a novel training-based mathematical model utilizing component information for crash similarity comparison and demonstrates its effectiveness in real-world deployment.

Findings

01

Achieved 0.986 AUC in crash duplicate detection

02

Saved 97% human effort in crash triage

03

Validated on 11,208 samples

Abstract

After a developer submits code, corresponding test cases arise to ensure the quality of software delivery. Test failures would occur during this period, such as crash, error, and timeout. Since it takes time for developers to resolve them, many duplicate failures will happen during this period. In the delivery practice of SAP HANA, crash triage is considered as the most time-consuming task. If duplicate crash failures can be automatically identified, the degree of automation will be significantly enhanced. To find such duplicates, we propose a training-based mathematical model that utilizes component information of SAP HANA to achieve better crash similarity comparison. We implement our approach in a tool named Knowledge-based Detector (K-Detector), which is verified by 11,208 samples and performs 0.986 in AUC. Furthermore, we have deployed K-Detector to the production environment, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SAP/crash-similarity-inspector
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.