Improving IR-Based Bug Localization with Context-Aware Query Reformulation
Mohammad Masudur Rahman, Chanchal K. Roy

TL;DR
The paper introduces BLIZZARD, a novel bug localization method that adaptively reformulates queries based on the bug report's information richness, significantly improving IR-based bug localization accuracy.
Contribution
BLIZZARD is the first technique to automatically determine and apply appropriate query reformulations for bug localization based on report content.
Findings
Achieves 7-56% higher Hit@10 compared to baseline.
Improves MAP@10 and MRR@10 by up to 62%.
Outperforms state-of-the-art techniques by 19% in MAP@10.
Abstract
Recent findings suggest that Information Retrieval (IR)-based bug localization techniques do not perform well if the bug report lacks rich structured information (eg relevant program entity names). Conversely, excessive structured information (eg stack traces) in the bug report might not always help the automated localization either. In this paper, we propose a novel technique--BLIZZARD-- that automatically localizes buggy entities from project source using appropriate query reformulation and effective information retrieval. In particular, our technique determines whether there are excessive program entities or not in a bug report (query), and then applies appropriate reformulations to the query for bug localization. Experiments using 5,139 bug reports show that our technique can localize the buggy source documents with 7%--56% higher Hit@10, 6%--62% higher MAP@10 and 6%--62% higher…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Web Application Security Vulnerabilities
