Communication-Efficient and Tensorized Federated Fine-Tuning of Large Language Models

Sajjad Ghiasvand; Yifan Yang; Zhiyu Xue; Mahnoosh Alizadeh; Zheng Zhang; Ramtin Pedarsani

arXiv:2410.13097·cs.LG·June 3, 2025·2 cites

Communication-Efficient and Tensorized Federated Fine-Tuning of Large Language Models

Sajjad Ghiasvand, Yifan Yang, Zhiyu Xue, Mahnoosh Alizadeh, Zheng Zhang, Ramtin Pedarsani

PDF

Open Access

TL;DR

This paper introduces FedTT and FedTT+ methods that enable communication-efficient, privacy-preserving federated fine-tuning of large language models using tensorized adapters, effectively handling data heterogeneity.

Contribution

The paper proposes novel tensorized adapter-based federated fine-tuning methods, FedTT and FedTT+, that reduce communication costs and improve robustness against data heterogeneity in federated learning.

Findings

01

Achieved up to 10× reduction in communication cost.

02

Performed on par or better than existing federated PEFT methods.

03

Demonstrated effectiveness on BERT and LLaMA models.

Abstract

Parameter-efficient fine-tuning (PEFT) methods typically assume that Large Language Models (LLMs) are trained on data from a single device or client. However, real-world scenarios often require fine-tuning these models on private data distributed across multiple devices. Federated Learning (FL) offers an appealing solution by preserving user privacy, as sensitive data remains on local devices during training. Nonetheless, integrating PEFT methods into FL introduces two main challenges: communication overhead and data heterogeneity. In this paper, we introduce FedTT and FedTT+, methods for adapting LLMs by integrating tensorized adapters into client-side models' encoder/decoder blocks. FedTT is versatile and can be applied to both cross-silo FL and large-scale cross-device FL. FedTT+, an extension of FedTT tailored for cross-silo FL, enhances robustness against data heterogeneity by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Softmax · Multi-Head Attention · WordPiece · Dropout · Layer Normalization · Adam · Attention Dropout · Attention Is All You Need