Human-AI Synergy in Agentic Code Review
Suzhen Zhong, Shayan Noei, Ying Zou, Bram Adams

TL;DR
This study empirically compares human and AI reviewers in open-source code review, revealing AI's limitations in suggestion quality and the importance of human oversight for effective collaboration.
Contribution
It provides large-scale empirical evidence on human-AI collaboration patterns and the effectiveness of AI agents versus human reviewers in code review workflows.
Findings
Humans provide more diverse feedback than AI agents.
AI suggestions are adopted less frequently than human suggestions.
AI suggestions tend to increase code complexity more when adopted.
Abstract
Code review is a critical software engineering practice where developers review code changes before integration to ensure code quality, detect defects, and improve maintainability. In recent years, AI agents that can understand code context, plan review actions, and interact with development environments have been increasingly integrated into the code review process. However, there is limited empirical evidence to compare the effectiveness of AI agents and human reviewers in collaborative workflows. To address this gap, we conduct a large-scale empirical analysis of 278,790 code review conversations across 300 open-source GitHub projects. In our study, we aim to compare the feedback differences provided by human reviewers and AI agents. We investigate human-AI collaboration patterns in review conversations to understand how interaction shapes review outcomes. Moreover, we analyze the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Scientific Computing and Data Management · Software Engineering Techniques and Practices
