Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation   Models

Yae Jee Cho; Luyang Liu; Zheng Xu; Aldi Fahrezi; Gauri; Joshi

arXiv:2401.06432·cs.LG·February 22, 2024·1 cites

Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri, Joshi

PDF

Open Access

TL;DR

This paper introduces HetLoRA, a novel federated fine-tuning method for on-device foundation models that uses heterogeneous low-rank approximations to address data and system heterogeneity, improving convergence and efficiency.

Contribution

HetLoRA allows heterogeneous LoRA ranks across devices, enabling efficient federated fine-tuning of foundation models with improved convergence and performance.

Findings

01

HetLoRA outperforms homogeneous LoRA in convergence speed.

02

HetLoRA achieves better final performance on federated tasks.

03

HetLoRA enhances computational efficiency compared to full fine-tuning.

Abstract

Foundation models (FMs) adapt well to specific domains or tasks with fine-tuning, and federated learning (FL) enables the potential for privacy-preserving fine-tuning of the FMs with on-device local data. For federated fine-tuning of FMs, we consider the FMs with small to medium parameter sizes of single digit billion at maximum, referred to as on-device FMs (ODFMs) that can be deployed on devices for inference but can only be fine-tuned with parameter efficient methods. In our work, we tackle the data and system heterogeneity problem of federated fine-tuning of ODFMs by proposing a novel method using heterogeneous low-rank approximations (LoRAs), namely HetLoRA. First, we show that the naive approach of using homogeneous LoRA ranks across devices face a trade-off between overfitting and slow convergence, and thus propose HetLoRA, which allows heterogeneous ranks across client devices…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings