Deep Learning-based Code Reviews: A Paradigm Shift or a Double-Edged Sword?
Rosalia Tufano, Alberto Martin-Lopez, Ahmad Tayeb, Ozren Dabi\'c,, Sonia Haiduc, Gabriele Bavota

TL;DR
This study investigates the impact of deep learning-generated code reviews on expert reviewers, revealing that while they identify many issues and influence review focus, they do not save time or boost confidence.
Contribution
The paper provides empirical evidence on how automated deep learning-based code reviews affect review quality, effort, and confidence, highlighting both benefits and limitations.
Findings
Reviewers find most issues identified by LLMs valid.
Automated reviews influence reviewers to focus on indicated code areas.
No significant time savings or confidence increase observed.
Abstract
Several techniques have been proposed to automate code review. Early support consisted in recommending the most suited reviewer for a given change or in prioritizing the review tasks. With the advent of deep learning in software engineering, the level of automation has been pushed to new heights, with approaches able to provide feedback on source code in natural language as a human reviewer would do. Also, recent work documented open source projects adopting Large Language Models (LLMs) as co-reviewers. Although the research in this field is very active, little is known about the actual impact of including automatically generated code reviews in the code review process. While there are many aspects worth investigating, in this work we focus on three of them: (i) review quality, i.e., the reviewer's ability to identify issues in the code; (ii) review cost, i.e., the time spent reviewing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Natural Language Processing Techniques · Topic Modeling
