Previously on... Automating Code Review

Robert Heum\"uller; Frank Ortmeier

arXiv:2508.18003·cs.SE·August 26, 2025

Previously on... Automating Code Review

Robert Heum\"uller, Frank Ortmeier

PDF

TL;DR

This paper systematically analyzes recent research on automating Modern Code Review using machine learning, highlighting challenges, variability, and the need for standardization to guide future advancements in the field.

Contribution

It provides the first comprehensive review of MCR automation research, formalizes tasks, identifies methodological challenges, and offers recommendations for standardization and improved evaluation practices.

Findings

01

Significant variability in task definitions and datasets

02

Limited reuse of datasets across studies

03

Identified challenges like temporal bias in evaluations

Abstract

Modern Code Review (MCR) is a standard practice in software engineering, yet it demands substantial time and resource investments. Recent research has increasingly explored automating core review tasks using machine learning (ML) and deep learning (DL). As a result, there is substantial variability in task definitions, datasets, and evaluation procedures. This study provides the first comprehensive analysis of MCR automation research, aiming to characterize the field's evolution, formalize learning tasks, highlight methodological challenges, and offer actionable recommendations to guide future research. Focusing on the primary code review tasks, we systematically surveyed 691 publications and identified 24 relevant studies published between May 2015 and April 2024. Each study was analyzed in terms of tasks, models, metrics, baselines, results, validity concerns, and artifact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.