Code Review Automation Via Multi-task Federated LLM -- An Empirical Study
Jahnavi Kumar, Sridhar Chimalakonda

TL;DR
This paper investigates multi-task federated large language models for automating code review, aiming to improve efficiency and robustness by leveraging task relationships and federated learning, with a detailed empirical analysis of training techniques.
Contribution
It introduces a multi-task federated LLM approach for code review automation and compares different training strategies, highlighting the effectiveness of cumulative fine-tuning.
Findings
Sequential training causes catastrophic forgetting and is less efficient.
Cumulative fine-tuning outperforms separate models for each task.
Multi-task federated LLMs require further research for optimal fine-tuning.
Abstract
Code review is a crucial process before deploying code to production, as it validates the code, provides suggestions for improvements, and identifies errors such as missed edge cases. In projects with regular production releases, the effort required for peer code-reviews remains high. Consequently, there has been significant interest from software engineering (SE) researchers in automating the code review process. Previous research on code review automation has typically approached the task as three independent sub-tasks: review necessity prediction, review comment generation, and code refinement. Our study attempts to (i) leverage the relationships between the sub-tasks of code review automation, by developing a multi-task model that addresses all tasks in an integrated manner, and (ii) increase model robustness on unseen data via collaborative large language model (LLM) modeling,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Scientific Computing and Data Management · Software Engineering Research
