KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation
Wei Tao, Yucheng Zhou, Yanlin Wang, Hongyu Zhang, Haofen Wang,, Wenqiang Zhang

TL;DR
KADEL introduces a knowledge-aware denoising learning approach that leverages good-practice commit messages to improve automatic commit message generation, effectively handling noisy data and achieving state-of-the-art results.
Contribution
The paper proposes a novel method that utilizes good-practice commits for better training and introduces dynamic denoising to handle noisy data in commit message generation.
Findings
Achieves state-of-the-art performance on MCMD dataset.
Effectively leverages good-practice commits for training.
Dynamic denoising improves robustness against noisy data.
Abstract
Commit messages are natural language descriptions of code changes, which are important for software evolution such as code understanding and maintenance. However, previous methods are trained on the entire dataset without considering the fact that a portion of commit messages adhere to good practice (i.e., good-practice commits), while the rest do not. On the basis of our empirical study, we discover that training on good-practice commits significantly contributes to the commit message generation. Motivated by this finding, we propose a novel knowledge-aware denoising learning method called KADEL. Considering that good-practice commits constitute only a small proportion of the dataset, we align the remaining training samples with these good-practice commits. To achieve this, we propose a model that learns the commit knowledge by training on good-practice commits. This knowledge model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Web Data Mining and Analysis · Software System Performance and Reliability
MethodsALIGN
