CodeNav: Beyond tool-use to using real-world codebases with LLM agents

Tanmay Gupta; Luca Weihs; Aniruddha Kembhavi

arXiv:2406.12276·cs.AI·June 19, 2024

CodeNav: Beyond tool-use to using real-world codebases with LLM agents

Tanmay Gupta, Luca Weihs, Aniruddha Kembhavi

PDF

Open Access

TL;DR

CodeNav is an LLM agent that intelligently navigates and leverages unseen code repositories to solve complex user queries, surpassing traditional tool-use methods by automatically indexing, searching, and utilizing code snippets.

Contribution

It introduces a novel LLM agent that automatically interacts with real-world codebases, reducing manual tool registration and enhancing code understanding capabilities.

Findings

01

CodeNav effectively solves complex queries across diverse codebases.

02

It outperforms traditional tool-use LLM agents in benchmark tests.

03

Source code access improves agent performance more than natural language descriptions.

Abstract

We present CodeNav, an LLM agent that navigates and leverages previously unseen code repositories to solve user queries. In contrast to tool-use LLM agents that require ``registration'' of all relevant tools via manual descriptions within the LLM context, CodeNav automatically indexes and searches over code blocks in the target codebase, finds relevant code snippets, imports them, and uses them to iteratively generate a solution with execution feedback. To highlight the core-capabilities of CodeNav, we first showcase three case studies where we use CodeNav for solving complex user queries using three diverse codebases. Next, on three benchmarks, we quantitatively compare the effectiveness of code-use (which only has access to the target codebase) to tool-use (which has privileged access to all tool names and descriptions). Finally, we study the effect of varying kinds of tool and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security

MethodsLib