Personalized Collaborative Fine-Tuning for On-Device Large Language   Models

Nicolas Wagner; Dongyang Fan; Martin Jaggi

arXiv:2404.09753·cs.CL·August 8, 2024·1 cites

Personalized Collaborative Fine-Tuning for On-Device Large Language Models

Nicolas Wagner, Dongyang Fan, Martin Jaggi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel on-device collaborative fine-tuning method for large language models that leverages trust-weighted gradient aggregation and low-rank adaptation to improve performance with limited local data.

Contribution

It proposes three trust-based gradient aggregation schemes and integrates LoRA to reduce communication, outperforming existing methods in diverse data scenarios.

Findings

01

Outperforms FedAvg and local fine-tuning methods

02

Effective in heterogeneous and scarce data environments

03

Reduces communication overhead with LoRA

Abstract

We explore on-device self-supervised collaborative fine-tuning of large language models with limited local data availability. Taking inspiration from the collaborative learning community, we introduce three distinct trust-weighted gradient aggregation schemes: weight similarity-based, prediction similarity-based and validation performance-based. To minimize communication overhead, we integrate Low-Rank Adaptation (LoRA) and only exchange LoRA weight updates. Our protocols, driven by prediction and performance metrics, surpass both FedAvg and local fine-tuning methods, which is particularly evident in realistic scenarios with more diverse local data distributions. The results underscore the effectiveness of our approach in addressing heterogeneity and scarcity within local datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

epfml/personalized-collaborative-llms
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques · Expert finding and Q&A systems