GradualDiff-Fed: A Federated Learning Specialized Framework for Large Language Model

Amir Faiyaz; Tara Salman

arXiv:2506.19164·cs.LG·June 25, 2025

GradualDiff-Fed: A Federated Learning Specialized Framework for Large Language Model

Amir Faiyaz, Tara Salman

PDF

Open Access

TL;DR

GradualDiff-Fed introduces a federated learning framework for large language models that reduces communication costs by transmitting only weight differences, enabling efficient, privacy-preserving fine-tuning without performance loss.

Contribution

The paper presents GradualDiff-Fed, a novel FL framework tailored for LLMs that significantly cuts communication overhead while maintaining centralized training performance.

Findings

01

Achieves performance comparable to centralized training.

02

Reduces communication costs substantially.

03

Enables scalable, privacy-preserving fine-tuning of LLMs.

Abstract

The rapid proliferation of large language models (LLMs) has created an unprecedented demand for fine-tuning models for specialized domains, such as medical science. While federated learning (FL) offers a decentralized and privacy-preserving approach to collaboratively fine-tune LLMs without sharing raw data, it presents significant challenges, particularly in performance and managing large model sizes efficiently. In this paper, we introduce GradualDiff-Fed, an FL framework designed explicitly for LLMs, and their challenge of handling the high parameter size. GradualDiff-Fed reduces communication costs by transmitting only the difference of model weights rather than the entire model during training rounds. Such an approach significantly improves scalability and communication efficiency, making it more feasible to fine-tune LLMs across distributed clients without compromising…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data