FedSODA: Federated Fine-tuning of LLMs via Similarity Group Pruning and Orchestrated Distillation Alignment

Manning Zhu; Songtao Guo; Pengzhan Zhou; Yansong Ning; Chang Han; Dewen Qiao

arXiv:2508.12727·cs.LG·August 19, 2025

FedSODA: Federated Fine-tuning of LLMs via Similarity Group Pruning and Orchestrated Distillation Alignment

Manning Zhu, Songtao Guo, Pengzhan Zhou, Yansong Ning, Chang Han, Dewen Qiao

PDF

Open Access

TL;DR

FedSODA introduces a resource-efficient federated fine-tuning framework for large language models, utilizing similarity group pruning and orchestrated distillation to reduce resource demands while maintaining or improving performance.

Contribution

The paper proposes FedSODA, a novel FFT method that prunes redundant layers and aligns distillation, enabling efficient adaptation of LLMs on resource-constrained clients.

Findings

01

Reduces communication overhead by 70.6%.

02

Decreases storage usage by 75.6%.

03

Improves task accuracy by 3.1%.

Abstract

Federated fine-tuning (FFT) of large language models (LLMs) has recently emerged as a promising solution to enable domain-specific adaptation while preserving data privacy. Despite its benefits, FFT on resource-constrained clients relies on the high computational and memory demands of full-model fine-tuning, which limits the potential advancement. This paper presents FedSODA, a resource-efficient FFT framework that enables clients to adapt LLMs without accessing or storing the full model. Specifically, we first propose a similarity group pruning (SGP) module, which prunes redundant layers from the full LLM while retaining the most critical layers to preserve the model performance. Moreover, we introduce an orchestrated distillation alignment (ODA) module to reduce gradient divergence between the sub-LLM and the full LLM during FFT. Through the use of the QLoRA, clients only need to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Scheduling and Optimization Algorithms