On The Importance of Reasoning for Context Retrieval in Repository-Level   Code Editing

Alexander Kovrigin; Aleksandra Eliseeva; Yaroslav Zharov; Timofey; Bryksin

arXiv:2406.04464·cs.SE·June 10, 2024

On The Importance of Reasoning for Context Retrieval in Repository-Level Code Editing

Alexander Kovrigin, Aleksandra Eliseeva, Yaroslav Zharov, Timofey, Bryksin

PDF

Open Access 1 Repo

TL;DR

This paper investigates the role of reasoning in context retrieval for repository-level code editing, highlighting its benefits and limitations in improving context precision and sufficiency.

Contribution

It decouples context retrieval from other components, providing insights into reasoning's impact and outlining the role of specialized tools in codebase navigation.

Findings

01

Reasoning improves context precision.

02

Current methods lack ability to determine context sufficiency.

03

Specialized tools are crucial for effective context gathering.

Abstract

Recent advancements in code-fluent Large Language Models (LLMs) enabled the research on repository-level code editing. In such tasks, the model navigates and modifies the entire codebase of a project according to request. Hence, such tasks require efficient context retrieval, i.e., navigating vast codebases to gather relevant context. Despite the recognized importance of context retrieval, existing studies tend to approach repository-level coding tasks in an end-to-end manner, rendering the impact of individual components within these complicated systems unclear. In this work, we decouple the task of context retrieval from the other components of the repository-level code editing pipelines. We lay the groundwork to define the strengths and weaknesses of this component and the role that reasoning plays in it by conducting experiments that focus solely on context retrieval. We conclude…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jetbrains-research/ai-agents-code-editing
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Data Mining and Analysis

MethodsFocus