iCoRe: An Iterative Correlation-Aware Retriever for Bug Reproduction Test Generation
Junyi Wang, Jialun Cao, Zhongxin Liu

TL;DR
iCoRe is an iterative, correlation-aware retrieval method designed to improve bug reproduction test generation by addressing limitations of existing retrieval strategies, leading to significant accuracy improvements.
Contribution
The paper introduces iCoRe, a novel retrieval approach that explicitly models correlations between code, test cases, semantics, and feedback, enhancing bug reproduction test generation.
Findings
Achieves 42.0% and 52.8% fail-to-pass rates on benchmarks, improving over existing methods.
Effectively models correlations between code, test cases, semantics, and feedback.
Demonstrates significant relative improvements in bug reproduction accuracy.
Abstract
Automatically generating bug reproduction tests (BRT) from issue descriptions is crucial for software maintenance. LLM-based approaches have shown great potential for this task. Their effectiveness heavily relies on retrieving high-quality context from the codebase. The retrieval phase of existing approaches relies on either traditional methods like BM25 or LLM-driven strategies. LLM-based retrieval strategies typically equip an LLM with tools to autonomously explore the repository or select the most relevant files and code snippets from a provided list as context. However, these retrieval methods suffer from three key limitations: 1) They often employ a unified strategy for retrieving both source code and test cases, overlooking their distinct retrieval requirements. 2) They focus solely on semantic similarity while ignoring function call relationships, leading to irrelevant context.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
