Previously on... Automating Code Review
Robert Heum\"uller, Frank Ortmeier

TL;DR
This paper systematically analyzes recent research on automating Modern Code Review using machine learning, highlighting challenges, variability, and the need for standardization to guide future advancements in the field.
Contribution
It provides the first comprehensive review of MCR automation research, formalizes tasks, identifies methodological challenges, and offers recommendations for standardization and improved evaluation practices.
Findings
Significant variability in task definitions and datasets
Limited reuse of datasets across studies
Identified challenges like temporal bias in evaluations
Abstract
Modern Code Review (MCR) is a standard practice in software engineering, yet it demands substantial time and resource investments. Recent research has increasingly explored automating core review tasks using machine learning (ML) and deep learning (DL). As a result, there is substantial variability in task definitions, datasets, and evaluation procedures. This study provides the first comprehensive analysis of MCR automation research, aiming to characterize the field's evolution, formalize learning tasks, highlight methodological challenges, and offer actionable recommendations to guide future research. Focusing on the primary code review tasks, we systematically surveyed 691 publications and identified 24 relevant studies published between May 2015 and April 2024. Each study was analyzed in terms of tasks, models, metrics, baselines, results, validity concerns, and artifact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
