Two Stage Wireless Federated LoRA Fine-Tuning with Sparsified Orthogonal Updates
Bumjun Kim, Wan Choi

TL;DR
This paper proposes a novel wireless federated LoRA fine-tuning framework with adaptive sparsification and a two-stage algorithm, significantly reducing communication costs while maintaining high accuracy in large language model training.
Contribution
It introduces SOFT, an adaptive sparsification method, and TSFA, a two-stage algorithm for efficient wireless federated LoRA fine-tuning, with theoretical convergence analysis.
Findings
Achieves comparable accuracy to ideal models
Reduces communication overhead significantly
Enables scalable large model deployment in wireless FL
Abstract
Transformer-based large language models (LLMs) have achieved remarkable success across various tasks. Yet, fine-tuning such massive models in federated learning (FL) settings poses significant challenges due to resource constraints and communication overhead. Low-Rank Adaptation (LoRA) addresses these issues by training compact, low-rank matrices instead of fully fine-tuning large models. This paper introduces a wireless federated LoRA fine-tuning framework that optimizes both learning performance and communication efficiency. We provide a novel convergence analysis, revealing how LoRA rank and covariance effects influence FL training dynamics. Leveraging these insights, we propose Sparsified Orthogonal Fine-Tuning (\textbf{SOFT}), an adaptive sparsification method that streamlines parameter updates without expensive matrix multiplications and singular value decomposition (SVD)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
