Delving into Commit-Issue Correlation to Enhance Commit Message Generation Models
Liran Wang, Xunzhu Tang, Yichen He, Changyu Ren, Shuhua Shi, Chaoran, Yan, Zhoujun Li

TL;DR
This paper introduces a new dataset and a training paradigm that leverage commit-issue correlations to significantly improve automated commit message generation models.
Contribution
It constructs the first dataset combining correlated commits and issues and proposes ool, a novel training paradigm that enhances models by incorporating commit-issue correlation.
Findings
ool-enhanced models outperform original models in experiments.
The dataset includes both unlabeled and human-annotated commit-issue pairs.
Significant performance improvements demonstrate the effectiveness of the approach.
Abstract
Commit message generation (CMG) is a challenging task in automated software engineering that aims to generate natural language descriptions of code changes for commits. Previous methods all start from the modified code snippets, outputting commit messages through template-based, retrieval-based, or learning-based models. While these methods can summarize what is modified from the perspective of code, they struggle to provide reasons for the commit. The correlation between commits and issues that could be a critical factor for generating rational commit messages is still unexplored. In this work, we delve into the correlation between commits and issues from the perspective of dataset and methodology. We construct the first dataset anchored on combining correlated commits and issues. The dataset consists of an unlabeled commit-issue parallel part and a labeled part in which each example…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Advanced Software Engineering Methodologies
