LinkAnchor: An Autonomous LLM-Based Agent for Issue-to-Commit Link Recovery
Arshia Akhavan, Alireza Hoseinpour, Abbas Heydarnoori, Hamid Bagheri, Mehdi Keshani

TL;DR
LinkAnchor is an autonomous LLM-based agent that improves issue-to-commit link recovery by dynamically retrieving relevant data, overcoming context limitations and modeling complex commit chains.
Contribution
It introduces a lazy-access architecture enabling LLMs to process relevant data dynamically, addressing limitations of previous pairwise and context-restricted methods.
Findings
Achieves more accurate link recovery by considering commit chains.
Reduces computational costs by selective data retrieval.
Addresses context window limitations of LLMs in software traceability.
Abstract
Issue-to-commit link recovery in software repositories is fundamental to software traceability and project management, yet it remains a challenging task. Prior studies show that only about 42.2% of issues on GitHub are correctly linked to their commits, highlighting the need for more effective solutions. Existing work has explored a range of ML/DL approaches, and more recently, large language models (LLMs) have been applied to this problem. However, these methods face two major limitations. First, LLMs are restricted by limited context windows and cannot simultaneously process all available data sources, such as long commit histories, extensive issue discussions, and large code repositories. Second, most approaches operate on individual issue-commit pairs, where a model independently scores the relevance of a single commit to an issue. This pairwise formulation fails to account for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
