Predicting Usefulness of Code Review Comments using Textual Features and Developer Experience
Mohammad Masudur Rahman, and Chanchal K. Roy, Raula G. Kula

TL;DR
This paper analyzes textual and experience factors influencing code review comment usefulness and introduces RevHelper, a model that predicts comment usefulness with 66% accuracy to assist developers.
Contribution
It provides the first comparative analysis of useful versus non-useful review comments and develops RevHelper, a predictive tool to enhance review comment quality.
Findings
Useful comments share more vocabulary with code changes
Reviewers' experience correlates with comment usefulness
RevHelper predicts comment usefulness with 66% accuracy
Abstract
Although peer code review is widely adopted in both commercial and open source development, existing studies suggest that such code reviews often contain a significant amount of non-useful review comments. Unfortunately, to date, no tools or techniques exist that can provide automatic support in improving those non-useful comments. In this paper, we first report a comparative study between useful and non-useful review comments where we contrast between them using their textual characteristics, and reviewers' experience. Then, based on the findings from the study, we develop RevHelper, a prediction model that can help the developers improve their code review comments through automatic prediction of their usefulness during review submission. Comparative study using 1,116 review comments suggested that useful comments share more vocabulary with the changed code, contain salient items like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Open Source Software Innovations · Wikis in Education and Collaboration
