Multi-CoLoR: Context-Aware Localization and Reasoning across Multi-Language Codebases
Indira Vats, Sanjukta De, Subhayan Roy, Saurabh Bodhe, Lejin Varghese, Max Kiehn, Yonas Bedasso, Marsha Chechik

TL;DR
Multi-CoLoR is a novel framework that enhances code localization in multi-language repositories by combining organizational knowledge retrieval with graph-based reasoning, significantly improving accuracy and efficiency.
Contribution
It introduces a two-stage approach integrating issue context retrieval with graph traversal for better multi-language code localization.
Findings
Improves localization accuracy over baseline methods.
Reduces search space and tool calls in code retrieval.
Effective across diverse programming languages.
Abstract
Large language models demonstrate strong capabilities in code generation but struggle to navigate complex, multi-language repositories to locate relevant code. Effective code localization requires understanding both organizational context (e.g., historical issue-fix patterns) and structural relationships within heterogeneous codebases. Existing methods either (i) focus narrowly on single-language benchmarks, (ii) retrieve code across languages via shallow textual similarity, or (iii) assume no prior context. We present Multi-CoLoR, a framework for Context-aware Localization and Reasoning across Multi-Language codebases, which integrates organizational knowledge retrieval with graph-based reasoning to traverse complex software ecosystems. Multi-CoLoR operates in two stages: (i) a similar issue context (SIC) module retrieves semantically and organizationally related historical issues to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Machine Learning in Materials Science · Topic Modeling
