D&C: A Divide-and-Conquer Approach to IR-based Bug Localization
Anil Koyuncu, Tegawend\'e F. Bissyand\'e, Dongsun Kim, Kui Liu,, Jacques Klein, Martin Monperrus, Yves Le Traon

TL;DR
This paper introduces D&C, a divide-and-conquer machine learning approach that improves IR-based bug localization accuracy by training multiple classifiers on specific bug report sets, outperforming existing tools.
Contribution
The paper presents a novel learning-based method that assigns optimal weights to similarity measures, significantly enhancing bug localization performance over state-of-the-art tools.
Findings
D&C achieves higher MAP and MRR scores than existing tools.
D&C locates around 50% of bugs at Top1, 77% at Top5, and 85% at Top10.
The approach demonstrates stable and substantial improvements in bug localization accuracy.
Abstract
Many automated tasks in software maintenance rely on information retrieval techniques to identify specific information within unstructured data. Bug localization is such a typical task, where text in a bug report is analyzed to identify file locations in the source code that can be associated to the reported bug. Despite the promising results, the performance offered by IR-based bug localization tools is still not significant for large adoption. We argue that one reason could be the attempt to build a one-size-fits-all approach. In this paper, we extensively study the performance of state-of-the-art bug localization tools, focusing on query formulation and its importance with respect to the localization performance. Building on insights from this study, we propose a new learning approach where multiple classifier models are trained on clear-cut sets of bug-location pairs. Concretely, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Robotics and Automated Systems · Video Analysis and Summarization
