Industry-scale IR-based Bug Localization: A Perspective from Facebook
Vijayaraghavan Murali, Lee Gross, Rebecca Qian, Satish Chandra

TL;DR
This paper evaluates IR-based bug localization at Facebook's industrial scale, introduces Bug2Commit to address unique challenges, and demonstrates its superior accuracy over existing methods in real-world applications.
Contribution
The paper presents Bug2Commit, a novel IR-based bug localization tool tailored for industrial-scale, complex bug reports, and evaluates its effectiveness against existing techniques.
Findings
Bug2Commit outperforms existing IR methods by up to 17% in accuracy.
Effective in diverse Facebook applications like crash analysis and performance regressions.
Reduces bug triaging time and improves bug attribution in large-scale industrial settings.
Abstract
We explore the application of Information Retrieval (IR) based bug localization methods at a large industrial setting, Facebook. Facebook's code base evolves rapidly, with thousands of code changes being committed to a monolithic repository every day. When a bug is detected, it is often time-sensitive and imperative to identify the commit causing the bug in order to either revert it or fix it. This is complicated by the fact that bugs often manifest with complex and unwieldy features, such as stack traces and other metadata. Code commits also have various features associated with them, ranging from developer comments to test results. This poses unique challenges to bug localization methods, making it a highly non-trivial operation. In this paper we lay out several practical concerns for industry-level IR-based bug localization, and propose Bug2Commit, a tool that is designed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
