Predicting Line-Level Defects by Capturing Code Contexts with Hierarchical Transformers
Parvez Mahbub, Mohammad Masudur Rahman

TL;DR
Bugsplorer is a hierarchical transformer-based deep learning model that predicts defective lines in code with significantly improved accuracy, helping prioritize software quality assurance efforts effectively.
Contribution
The paper introduces Bugsplorer, a novel hierarchical transformer model specifically designed for line-level defect prediction, outperforming existing techniques.
Findings
Bugsplorer achieves 26-72% better accuracy than state-of-the-art methods.
It ranks the top 20% defective lines within the top 1-3% suspicious lines.
The approach can significantly reduce software quality assurance costs.
Abstract
Software defects consume 40% of the total budget in software development and cost the global economy billions of dollars every year. Unfortunately, despite the use of many software quality assurance (SQA) practices in software development (e.g., code review, continuous integration), defects may still exist in the official release of a software product. Therefore, prioritizing SQA efforts for the vulnerable areas of the codebase is essential to ensure the high quality of a software release. Predicting software defects at the line level could help prioritize the SQA effort but is a highly challenging task given that only ~3% of lines of a codebase could be defective. Existing works on line-level defect prediction often fall short and cannot fully leverage the line-level defect information. In this paper, we propose Bugsplorer, a novel deep-learning technique for line-level defect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Reliability and Analysis Research
