FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion

Tao Fan; Guoqiang Ma; Yuanfeng Song; Lixin Fan; Kai Chen; Qiang Yang

arXiv:2604.19015·cs.LG·April 22, 2026

FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion

Tao Fan, Guoqiang Ma, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang

PDF

TL;DR

FedProxy introduces a federated fine-tuning framework for LLMs that uses proxy models and heterogeneity-aware fusion to address privacy, IP protection, and performance challenges.

Contribution

It proposes a novel three-stage architecture with server-guided compression, interference-mitigating aggregation, and a plug-in fusion mechanism for secure, high-performance federated LLM adaptation.

Findings

01

FedProxy outperforms existing OT methods significantly.

02

Approaches centralized training performance in federated settings.

03

Establishes new benchmarks for secure federated LLM fine-tuning.

Abstract

Federated fine-tuning of Large Language Models (LLMs) is obstructed by a trilemma of challenges: protecting LLMs intellectual property (IP), ensuring client privacy, and mitigating performance loss on heterogeneous data. Existing methods like Offsite-Tuning (OT) secure the LLMs IP by having clients train only lightweight adapters, yet our analysis reveals they suffer from a fundamental performance bottleneck, leaving a significant gap compared to centralized training. To bridge this gap, we introduce FedProxy, a new federated adaptation framework. FedProxy replaces weak adapters with a unified, powerful Proxy Small Language Model (SLM), compressed from the proprietary LLM, to serve as a high-fidelity surrogate for collaborative fine-tuning. Our framework systematically resolves the trilemma through a three-stage architecture: (i) Efficient Representation via server-guided compression to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.