Deep Learning-based Code Reviews: A Paradigm Shift or a Double-Edged   Sword?

Rosalia Tufano; Alberto Martin-Lopez; Ahmad Tayeb; Ozren Dabi\'c,; Sonia Haiduc; Gabriele Bavota

arXiv:2411.11401·cs.SE·December 2, 2024

Deep Learning-based Code Reviews: A Paradigm Shift or a Double-Edged Sword?

Rosalia Tufano, Alberto Martin-Lopez, Ahmad Tayeb, Ozren Dabi\'c,, Sonia Haiduc, Gabriele Bavota

PDF

Open Access 1 Repo

TL;DR

This study investigates the impact of deep learning-generated code reviews on expert reviewers, revealing that while they identify many issues and influence review focus, they do not save time or boost confidence.

Contribution

The paper provides empirical evidence on how automated deep learning-based code reviews affect review quality, effort, and confidence, highlighting both benefits and limitations.

Findings

01

Reviewers find most issues identified by LLMs valid.

02

Automated reviews influence reviewers to focus on indicated code areas.

03

No significant time savings or confidence increase observed.

Abstract

Several techniques have been proposed to automate code review. Early support consisted in recommending the most suited reviewer for a given change or in prioritizing the review tasks. With the advent of deep learning in software engineering, the level of automation has been pushed to new heights, with approaches able to provide feedback on source code in natural language as a human reviewer would do. Also, recent work documented open source projects adopting Large Language Models (LLMs) as co-reviewers. Although the research in this field is very active, little is known about the actual impact of including automatically generated code reviews in the code review process. While there are many aspects worth investigating, in this work we focus on three of them: (i) review quality, i.e., the reviewer's ability to identify issues in the code; (ii) review cost, i.e., the time spent reviewing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

codereviewexperiment/code_review_controlled_experiment
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Natural Language Processing Techniques · Topic Modeling