Do Code Review Measures Explain the Incidence of Post-Release Defects?
Andrey Krutauz, Tapajit Dey, Peter C. Rigby, Audris Mockus

TL;DR
This study investigates whether code review measures can explain post-release defects by replicating prior research, applying Bayesian Network models, and analyzing data from Qt and Google Chrome to understand predictor influences.
Contribution
It demonstrates that code review measures have limited direct impact on post-release defects and highlights the importance of prior defects, module size, and authorship as key predictors.
Findings
Models without review predictors perform as well or better.
Review measures influence defects indirectly, not directly.
Prior defects, size, and authorship are strong predictors.
Abstract
Aim: In contrast to studies of defects found during code review, we aim to clarify whether code reviews measures can explain the prevalence of post-release defects. Method: We replicate a study by McIntoshet. al that uses additive regression to model the relationship between defects and code reviews. To increase external validity, we apply the same methodology on a new software project. We discuss our findings with the first author of the original study, McIntosh. We then investigate how to reduce the impact of correlated predictors in the variable selection process and how to increase understanding of the inter-relationships among the predictors by employing Bayesian Network (BN) models. Context: As in the original study, we use the same measures authors obtained for Qt project in the original study. We mine data from version control and issue tracker of Google Chrome and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
