Similar Pattern Annotation via Retrieval Knowledge for LLM-Based Test Code Fault Localization
Golnaz Gharachorlu, Mahsa Panahandeh, Lionel C. Briand, Ruifeng Gao, Ruiyuan Wan

TL;DR
SPARK enhances large language model-based test code fault localization by retrieving and annotating similar fault patterns from a knowledge corpus, improving accuracy in industrial settings.
Contribution
It introduces a retrieval-augmented framework that leverages debugging knowledge to guide LLMs in fault localization for test scripts, addressing scalability and complexity.
Findings
SPARK outperforms existing LLM-based TCFL methods in accuracy.
It maintains similar inference costs and token usage.
It effectively identifies multiple faults in complex test cases.
Abstract
Software failures remain a major challenge in modern software development, and identifying the code elements responsible for failures is a time-consuming debugging task. While extensive research has focused on fault localization in the system under test (SUT), failures can also originate from faulty system test scripts. This problem, known as Test Code Fault Localization (TCFL), has received significantly less attention despite its importance in continuous integration (CI) environments where large test suites are executed frequently. TCFL is particularly challenging because it typically operates under black-box conditions, relies on limited diagnostic signals such as error messages and partial logs, and involves large system-level test scripts that expand the fault localization search space. In this paper, we propose SPARK, a framework that integrates accumulated debugging knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
