RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair
Zhongqiang Pan, Chuanyi Li, Wenkang Zhong, Yi Feng, Bin Luo, Vincent Ng

TL;DR
RepoRepair introduces a novel approach that leverages generated hierarchical code documentation and large language models to improve repository-level program repair, significantly enhancing fault localization and repair success rates.
Contribution
It presents a new documentation-enhanced method using LLMs for global fault localization and repair at the repository level, addressing limitations of existing APR techniques.
Findings
Achieves 45.7% repair rate on SWE-bench Lite
Attains 37.1% repair rate on SWE-bench Multimodal
Demonstrates cost-effective and robust performance across domains
Abstract
Automated program repair (APR) struggles to scale from isolated functions to full repositories, as it demands a global, task-aware understanding to locate necessary changes. Current methods, limited by context and reliant on shallow retrieval or costly agent iterations, falter on complex cross-file issues. To this end, we propose RepoRepair, a novel documentation-enhanced approach for repository-level fault localization and program repair. Our core insight is to leverage LLMs to generate hierarchical code documentation (from functions to files) for code repositories, creating structured semantic abstractions that enable LLMs to comprehend repository-level context and dependencies. Specifically, RepoRepair first employs a text-based LLM (e.g., DeepSeek-V3) to generate file/function-level code documentation for repositories, which serves as auxiliary knowledge to guide fault localization.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software System Performance and Reliability · Software Engineering Research
