Impact of LLM-based Review Comment Generation in Practice: A Mixed Open-/Closed-source User Study
Doriane Olewicki, Leuson Da Silva, Suhaib Mujahid, Arezou Amini,, Benjamin Mah, Marco Castelluccio, Sarra Habchi, Foutse Khomh, Bram Adams

TL;DR
This study evaluates the acceptance and impact of LLM-generated review comments in real-world software review settings across open and closed-source organizations, showing moderate acceptance and comparable effectiveness to human comments.
Contribution
It provides the first large-scale empirical user study on LLM-generated review comments in live environments, demonstrating their acceptance and influence on the review process.
Findings
8.1% and 7.2% of generated comments accepted by reviewers
Refactoring comments more likely to be accepted than functional ones
Generated comments are as effective as human comments in prompting future revisions
Abstract
We conduct a large-scale empirical user study in a live setup to evaluate the acceptance of LLM-generated comments and their impact on the review process. This user study was performed in two organizations, Mozilla (which has its codebase available as open source) and Ubisoft (fully closed-source). Inside their usual review environment, participants were given access to RevMate, an LLM-based assistive tool suggesting generated review comments using an off-the-shelf LLM with Retrieval Augmented Generation to provide extra code and review context, combined with LLM-as-a-Judge, to auto-evaluate the generated comments and discard irrelevant cases. Based on more than 587 patch reviews provided by RevMate, we observed that 8.1% and 7.2%, respectively, of LLM-generated comments were accepted by reviewers in each organization, while 14.6% and 20.5% other comments were still marked as valuable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTranslation Studies and Practices · Wikis in Education and Collaboration · Expert finding and Q&A systems
