Using Large-scale Heterogeneous Graph Representation Learning for Code Review Recommendations at Microsoft
Jiyang Zhang, Chandra Maddila, Ram Bairi, Christian Bird, Ujjwal, Raizada, Apoorva Agrawal, Yamini Jhawar, Kim Herzig, Arie van Deursen

TL;DR
This paper introduces CORAL, a graph neural network-based system that leverages rich socio-technical data to improve reviewer recommendations in large-scale software projects, outperforming traditional methods especially on larger repositories.
Contribution
The paper presents a novel graph convolutional neural network approach using socio-technical graphs for reviewer recommendation, addressing limitations of existing methods by incorporating diverse developer artifacts.
Findings
CORAL models reviewer selection history effectively.
CORAL identifies relevant reviewers missed by traditional systems.
Classical methods perform better on small projects, CORAL excels on large ones.
Abstract
Code review is an integral part of any mature software development process, and identifying the best reviewer for a code change is a well-accepted problem within the software engineering community. Selecting a reviewer who lacks expertise and understanding can slow development or result in more defects. To date, most reviewer recommendation systems rely primarily on historical file change and review information; those who changed or reviewed a file in the past are the best positioned to review in the future. We posit that while these approaches are able to identify and suggest qualified reviewers, they may be blind to reviewers who have the needed expertise and have simply never interacted with the changed files before. Fortunately, at Microsoft, we have a wealth of work artifacts across many repositories that can yield valuable information about our developers. To address the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Software System Performance and Reliability
MethodsCorrelation Alignment for Deep Domain Adaptation
