Code Review Automation Via Multi-task Federated LLM -- An Empirical   Study

Jahnavi Kumar; Sridhar Chimalakonda

arXiv:2412.15676·cs.SE·December 23, 2024

Code Review Automation Via Multi-task Federated LLM -- An Empirical Study

Jahnavi Kumar, Sridhar Chimalakonda

PDF

Open Access

TL;DR

This paper investigates multi-task federated large language models for automating code review, aiming to improve efficiency and robustness by leveraging task relationships and federated learning, with a detailed empirical analysis of training techniques.

Contribution

It introduces a multi-task federated LLM approach for code review automation and compares different training strategies, highlighting the effectiveness of cumulative fine-tuning.

Findings

01

Sequential training causes catastrophic forgetting and is less efficient.

02

Cumulative fine-tuning outperforms separate models for each task.

03

Multi-task federated LLMs require further research for optimal fine-tuning.

Abstract

Code review is a crucial process before deploying code to production, as it validates the code, provides suggestions for improvements, and identifies errors such as missed edge cases. In projects with regular production releases, the effort required for peer code-reviews remains high. Consequently, there has been significant interest from software engineering (SE) researchers in automating the code review process. Previous research on code review automation has typically approached the task as three independent sub-tasks: review necessity prediction, review comment generation, and code refinement. Our study attempts to (i) leverage the relationships between the sub-tasks of code review automation, by developing a multi-task model that addresses all tasks in an integrated manner, and (ii) increase model robustness on unseen data via collaborative large language model (LLM) modeling,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Scientific Computing and Data Management · Software Engineering Research