Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of   Large Language Models

Zihan Fang; Zheng Lin; Zhe Chen; Xianhao Chen; Yue Gao; Yuguang Fang

arXiv:2404.06448·cs.LG·December 9, 2024·3 cites

Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models

Zihan Fang, Zheng Lin, Zhe Chen, Xianhao Chen, Yue Gao, Yuguang Fang

PDF

Open Access

TL;DR

This paper introduces FedPipe, an automated federated pipeline that efficiently fine-tunes large language models on private data by identifying key weights, using low-rank adapters, and quantizing parameters, reducing training costs and latency.

Contribution

The paper presents FedPipe, a novel automated federated approach for parameter-efficient LLM fine-tuning that adapts to edge resource constraints and improves training speed and accuracy.

Findings

01

FedPipe accelerates LLM fine-tuning compared to existing methods.

02

It achieves higher accuracy in downstream tasks.

03

Reduces memory and computational costs significantly.

Abstract

Recently, there has been a surge in the development of advanced intelligent generative content (AIGC), especially large language models (LLMs). However, for many downstream tasks, it is necessary to fine-tune LLMs using private data. While federated learning offers a promising privacy-preserving solution to LLM fine-tuning, the substantial size of an LLM, combined with high computational and communication demands, makes it hard to apply to downstream tasks. More importantly, private edge servers often possess varying computing and network resources in real-world scenarios, introducing additional complexities to LLM fine-tuning. To tackle these problems, we design and implement an automated federated pipeline, named FedPipe, to fine-tune LLMs with minimal training cost but without adding any inference latency. FedPipe firstly identifies the weights to be fine-tuned based on their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsAdapter