Convergence Analysis of Aggregation-Broadcast in LoRA-enabled Distributed Fine-Tuning
Xin Chen, Shuaijun Chen, Omid Tavallaie, Nguyen Tran, Shuhuang Xiang, Albert Zomaya

TL;DR
This paper provides a theoretical convergence analysis of aggregation strategies in LoRA-enabled federated learning, clarifying conditions for global and local model convergence and comparing their effectiveness.
Contribution
It introduces a unified convergence framework for LoRA-based federated learning, categorizes aggregation methods, and analyzes their convergence properties with theoretical guarantees.
Findings
Sum-Product aggregation satisfies weak convergence conditions.
Product-Sum aggregation satisfies strong convergence conditions.
Experimental results validate the theoretical convergence analysis.
Abstract
Federated Learning (FL) enables collaborative model training across decentralized data sources while preserving data privacy. However, the growing size of Machine Learning (ML) models poses communication and computation challenges in FL. Low-Rank Adaptation (LoRA) has recently been introduced into FL as an efficient fine-tuning method, reducing communication overhead by updating only a small number of trainable parameters. Despite its effectiveness, how to aggregate LoRA-updated local models on the server remains a critical and understudied problem. In this paper, we provide a unified convergence analysis for LoRA-based FL. We first categories the current aggregation method into two major type: Sum-Product (SP) and Product-Sum (PS). Then we formally define the Aggregation-Broadcast Operator (ABO) and derive both weak and strong convergence condition under mild assumptions. Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · IoT and Edge/Fog Computing · Stochastic Gradient Optimization Techniques
