GradualDiff-Fed: A Federated Learning Specialized Framework for Large Language Model
Amir Faiyaz, Tara Salman

TL;DR
GradualDiff-Fed introduces a federated learning framework for large language models that reduces communication costs by transmitting only weight differences, enabling efficient, privacy-preserving fine-tuning without performance loss.
Contribution
The paper presents GradualDiff-Fed, a novel FL framework tailored for LLMs that significantly cuts communication overhead while maintaining centralized training performance.
Findings
Achieves performance comparable to centralized training.
Reduces communication costs substantially.
Enables scalable, privacy-preserving fine-tuning of LLMs.
Abstract
The rapid proliferation of large language models (LLMs) has created an unprecedented demand for fine-tuning models for specialized domains, such as medical science. While federated learning (FL) offers a decentralized and privacy-preserving approach to collaboratively fine-tune LLMs without sharing raw data, it presents significant challenges, particularly in performance and managing large model sizes efficiently. In this paper, we introduce GradualDiff-Fed, an FL framework designed explicitly for LLMs, and their challenge of handling the high parameter size. GradualDiff-Fed reduces communication costs by transmitting only the difference of model weights rather than the entire model during training rounds. Such an approach significantly improves scalability and communication efficiency, making it more feasible to fine-tune LLMs across distributed clients without compromising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
