Learning Software Bug Reports: A Systematic Literature Review

Guoming Long; Jingzhi Gong; Hui Fang; Tao Chen

arXiv:2507.04422·cs.SE·July 22, 2025

Learning Software Bug Reports: A Systematic Literature Review

Guoming Long, Jingzhi Gong, Hui Fang, Tao Chen

PDF

TL;DR

This systematic review analyzes 204 papers on machine learning techniques for bug report analysis, highlighting current trends, common methods, and future research directions in the field.

Contribution

It provides a comprehensive synthesis of ML-based bug report analysis research, identifying prevalent techniques, challenges, and gaps in the literature.

Findings

01

CNN, LSTM, and kNN are widely used for bug report analysis.

02

Word2Vec and TF-IDF are popular feature representations.

03

Evaluation commonly uses F1-score, Recall, and cross-validation.

Abstract

The recent advancement of artificial intelligence, especially machine learning (ML), has significantly impacted software engineering research, including bug report analysis. ML aims to automate the understanding, extraction, and correlation of information from bug reports. Despite its growing importance, there has been no comprehensive review in this area. In this paper, we present a systematic literature review covering 1,825 papers, selecting 204 for detailed analysis. We derive seven key findings: 1) Extensive use of CNN, LSTM, and $k$ NN for bug report analysis, with advanced models like BERT underutilized due to their complexity. 2) Word2Vec and TF-IDF are popular for feature representation, with a rise in deep learning approaches. 3) Stop word removal is the most common preprocessing, with structural methods rising after 2020. 4) Eclipse and Mozilla are the most frequently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.