FLoRIST: Singular Value Thresholding for Efficient and Accurate Federated Fine-Tuning of Large Language Models
Hariharan Ramesh, Jyotikrishna Dass

TL;DR
FLoRIST introduces a federated fine-tuning method for large language models that uses singular value thresholding to efficiently aggregate local adaptations, achieving high accuracy with low communication and computational costs.
Contribution
FLoRIST proposes a novel singular value decomposition-based aggregation method for federated LoRA, improving efficiency and accuracy without reconstructing large matrices.
Findings
Outperforms existing methods in communication efficiency and accuracy.
Works effectively across diverse datasets and model heterogeneity.
Achieves a good balance between performance and resource consumption.
Abstract
Integrating Low-Rank Adaptation (LoRA) into federated learning offers a promising solution for parameter-efficient fine-tuning of Large Language Models (LLMs) without sharing local data. However, several methods designed for federated LoRA present significant challenges in balancing communication efficiency, model accuracy, and computational cost, particularly among heterogeneous clients. These methods either rely on simplistic averaging of local adapters, which introduces aggregation noise, require transmitting large stacked local adapters, leading to poor communication efficiency, or necessitate reconstructing memory-dense global weight-update matrix and performing computationally expensive decomposition to design client-specific low-rank adapters. In this work, we propose FLoRIST, a federated fine-tuning framework that achieves mathematically accurate aggregation without incurring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Advanced Neural Network Applications
