RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
Siru Ouyang, Wenhao Yu, Kaixin Ma, Zilin Xiao, Zhihan Zhang, Mengzhao, Jia, Jiawei Han, Hongming Zhang, Dong Yu

TL;DR
RepoGraph is a repository-level code understanding tool that significantly improves AI software engineering tasks by providing structured guidance and navigation, achieving state-of-the-art results on multiple benchmarks.
Contribution
Introducing RepoGraph, a novel plugin that manages repository-level code structure, enhancing AI software engineering and outperforming existing methods on key benchmarks.
Findings
Substantially boosts performance of existing AI coding systems
Achieves new state-of-the-art results on SWE-bench
Demonstrates extensibility on CrossCodeEval
Abstract
Large Language Models (LLMs) excel in code generation yet struggle with modern AI software engineering tasks. Unlike traditional function-level or file-level coding tasks, AI software engineering requires not only basic coding proficiency but also advanced skills in managing and interacting with code repositories. However, existing methods often overlook the need for repository-level code understanding, which is crucial for accurately grasping the broader context and developing effective solutions. On this basis, we present RepoGraph, a plug-in module that manages a repository-level structure for modern AI software engineering solutions. RepoGraph offers the desired guidance and serves as a repository-wide navigation for AI software engineers. We evaluate RepoGraph on the SWE-bench by plugging it into four different methods of two lines of approaches, where RepoGraph substantially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Scientific Computing and Data Management · Software Testing and Debugging Techniques
