LARGER: Lexically Anchored Repository Graph Exploration and Retrieval
Yuntong Hu, Tongli Su, Liang Zhao, Bowen Zhu, Hasibul Haque

TL;DR
LARGER is a framework that enhances repository navigation for coding agents by converting lexical matches into structural context, improving localization and understanding without external graph tools.
Contribution
It introduces a lexically anchored active-set retrieval method that integrates structural graph exploration into existing search loops for better code repository localization.
Findings
Improves file-level Acc@5 on LocBench by +13.9 points with tuned hyperparameters.
Gains +11.8 points on LocBench with fixed hyperparameters over the strongest baseline.
Delivers consistent improvements across multiple benchmarks including MuLocBench and SWE-Atlas.
Abstract
Repository-level coding agents must first localize the files and symbols relevant to a task; failures at this stage can cascade across downstream objectives ranging from patch generation to test writing and codebase question answering. Existing agents navigate repositories primarily through lexical search, often missing structural relations such as imports, call chains, type hierarchies, and code-test links. Graph-based retrieval can recover such dependencies, but existing approaches often require separate graph tools or traversal stages that fragment the agent's interaction loop. We formalize repository context localization as Lexically Anchored Structural Localization, where success depends on turning lexical matches into high-precision structural entry points and exposing the most useful confidence-filtered local neighborhoods within the agent's existing search loop. We introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
