Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning
Arian Raje, Baris Askin, Divyansh Jhunjhunwala, Gauri Joshi

TL;DR
Ravan is an adaptive multi-head LoRA method for federated fine-tuning of large language models, improving accuracy while maintaining parameter efficiency across heterogeneous edge devices.
Contribution
It introduces a novel multi-head LoRA approach with trainable scaling factors, enhancing model expressivity and accuracy in federated learning settings.
Findings
Improves test accuracy by 2-8% over prior methods
Effective on vision and language benchmarks
Balances parameter efficiency and model performance
Abstract
Large language models (LLMs) have not yet effectively leveraged the vast amounts of edge-device data, and federated learning (FL) offers a promising paradigm to collaboratively fine-tune LLMs without transferring private edge data to the cloud. To operate within the computation and communication constraints of edge devices, recent literature on federated fine-tuning of LLMs proposes the use of low-rank adaptation (LoRA) and similar parameter-efficient methods. However, LoRA-based methods suffer from accuracy degradation in FL settings, primarily because of data and computational heterogeneity across clients. We propose Ravan, an adaptive multi-head LoRA method that balances parameter efficiency and model expressivity by reparameterizing the weight updates as the sum of multiple LoRA heads in which only the core matrices and their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · IoT and Edge/Fog Computing · Advanced Neural Network Applications
MethodsFocus
