Automated Recovery of Issue-Commit Links Leveraging Both Textual and Non-textual Data
Pooya Rostami Mazrae, Maliheh Izadi, Abbas Heydarnoori

TL;DR
This paper introduces Hybrid-Linker, a novel approach that combines textual and non-textual data to improve the accuracy and reliability of automatically linking issues to commits in software development, outperforming existing methods.
Contribution
Hybrid-Linker is the first method to effectively integrate textual and non-textual information for commit-issue linking, significantly enhancing precision and recall over state-of-the-art approaches.
Findings
Hybrid-Linker achieves over 90% recall and precision.
It outperforms FRLink and DeepLink by over 30% in F-measure.
The approach is effective across multiple software projects.
Abstract
An issue documents discussions around required changes in issue-tracking systems, while a commit contains the change itself in the version control systems. Recovering links between issues and commits can facilitate many software evolution tasks such as bug localization, and software documentation. A previous study on over half a million issues from GitHub reports only about 42.2% of issues are manually linked by developers to their pertinent commits. Automating the linking of commit-issue pairs can contribute to the improvement of the said tasks. By far, current state-of-the-art approaches for automated commit-issue linking suffer from low precision, leading to unreliable results, sometimes to the point that imposes human supervision on the predicted links. The low performance gets even more severe when there is a lack of textual information in either commits or issues. Current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Engineering Techniques and Practices
