AI-Assisted Fixes to Code Review Comments at Scale

Chandra Maddila; Negar Ghorbani; James Saindon; Parth Thakkar; Vijayaraghavan Murali; Rui Abreu; Jingyue Shen; Brian Zhou; Nachiappan Nagappan; and Peter C. Rigby

arXiv:2507.13499·cs.SE·July 21, 2025

AI-Assisted Fixes to Code Review Comments at Scale

Chandra Maddila, Negar Ghorbani, James Saindon, Parth Thakkar, Vijayaraghavan Murali, Rui Abreu, Jingyue Shen, Brian Zhou, Nachiappan Nagappan, and Peter C. Rigby

PDF

Open Access

TL;DR

This paper presents MetaMateCR, an AI system that provides automated fixes for code review comments at Meta, demonstrating improved accuracy and safety through extensive offline and production evaluations.

Contribution

We developed and deployed MetaMateCR, an AI-powered tool for code review comment fixes, with a large-scale benchmark, safety trials, and real-world deployment at Meta.

Findings

01

LargeLSFT model achieves 68% exact match patches, outperforming GPT-4o.

02

Safety trials show UX modifications prevent review time regressions.

03

Production results show a 19.7% actionable-to-applied patch rate.

Abstract

Aim. There are 10s of thousands of code review comments each week at Meta. We developed Metamate for Code Review (MetaMateCR) that provides AI-assisted fixes for reviewer comments in production at scale. Method. We developed an internal benchmark of 64k <review comment, patch> data points to fine-tune Llama models. Once our models achieve reasonable offline results, we roll them into production. To ensure that our AI-assisted fixes do not negatively impact the time it takes to do code reviews, we conduct randomized controlled safety trials as well as full production experiments. Offline Results. As a baseline, we compare GPT-4o to our small and large Llama models. In offline results, our LargeLSFT model creates an exact match patch 68% of the time outperforming GPT-4o by 9 percentage points (pp). The internal models also use more modern Hack functions when compared to the PHP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Artificial Intelligence in Healthcare and Education