On the Informativeness of Security Commit Messages: A Large-scale Replication Study
Syful Islam, Stefano Zacchiroli

TL;DR
This study confirms that security commit messages are generally uninformative for patch triage, and finds that informativeness is declining over time and varies across ecosystems, with some best practices being counterproductive.
Contribution
It replicates prior findings on commit message informativeness, extends analysis over a longer period, compares different ecosystems, and evaluates the impact of best practices like CCS.
Findings
Commit messages are generally uninformative for security purposes.
Informativeness has worsened over the extended time period.
CCS-compliant commits are less informative than non-compliant ones.
Abstract
The informativeness of security-related commit messages is crucial for patch triage: when high, it enables the rapid distribution and deployment of security fixes. Prior research (Reis et al., 2023) reported, however, that commit messages are often too uninformative to support these activities. To assess the robustness of this negative result, we independently replicate the original study using only the information provided in the paper, without reusing any of the original artifacts (data, analysis pipeline, etc.). We retrieve \num{50673} security-related commits and analyze their informativeness using an independent re-implementation of the techniques introduced by Reis et al. For the same source (i.e., GitHub) and time period (from June 1999 to August 2022) as the original study, our replication confirms the original findings in a statistically significant way: security-related commit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
