Fault Localization from the Semantic Code Search Perspective
Yihao Qin, Shangwen Wang, Yan Lei, Zhuo Zhang, Bo Lin, Xin Peng,, Liqian Chen, Xiaoguang Mao

TL;DR
This paper introduces CosFL, a novel fault localization method that leverages semantic code search and large language models to improve bug localization accuracy, significantly outperforming existing approaches.
Contribution
CosFL decomposes fault localization into query generation and fault retrieval, utilizing semantic understanding and multi-granularity search to enhance bug localization effectiveness.
Findings
Localized 324 bugs within Top-1 out of 835 bugs.
Outperformed state-of-the-art approaches by 26.6%-57.3%.
Effectively used LLMs for semantic code understanding.
Abstract
The software development process is characterized by an iterative cycle of continuous functionality implementation and debugging, essential for the enhancement of software quality and adaptability to changing requirements. This process incorporates two isolatedly studied tasks: Code Search (CS), which retrieves reference code from a code corpus to aid in code implementation, and Fault Localization (FL), which identifies code entities responsible for bugs within the software project to boost software debugging. These two tasks exhibit similarities since they both address search problems. Notably, CS techniques have demonstrated greater effectiveness than FL ones, possibly because of the precise semantic details of the required code offered by natural language queries, which are not readily accessible to FL methods. Drawing inspiration from this, we hypothesize that a fault localizer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Service-Oriented Architecture and Web Services
