Towards a Larger Model via One-Shot Federated Learning on Heterogeneous Client Models

Wenxuan Ye; Xueli An; Onur Ayan; Junfan Wang; Xueqiang Yan; Georg Carle

arXiv:2508.13625·cs.LG·August 20, 2025

Towards a Larger Model via One-Shot Federated Learning on Heterogeneous Client Models

Wenxuan Ye, Xueli An, Onur Ayan, Junfan Wang, Xueqiang Yan, Georg Carle

PDF

TL;DR

FedOL enables one-shot federated learning of larger, more comprehensive server models by exchanging predictions instead of raw data or model weights, effectively handling heterogeneous client models and data biases.

Contribution

The paper introduces FedOL, a novel one-shot federated learning framework using knowledge distillation to build larger server models with heterogeneous client architectures.

Findings

01

FedOL outperforms existing baselines in simulation tests.

02

It reduces communication overhead by exchanging predictions.

03

It effectively handles data bias and model heterogeneity.

Abstract

Large models, renowned for superior performance, outperform smaller ones even without billion-parameter scales. While mobile network servers have ample computational resources to support larger models than client devices, privacy constraints prevent clients from directly sharing their raw data. Federated Learning (FL) enables decentralized clients to collaboratively train a shared model by exchanging model parameters instead of transmitting raw data. Yet, it requires a uniform model architecture and multiple communication rounds, which neglect resource heterogeneity, impose heavy computational demands on clients, and increase communication overhead. To address these challenges, we propose FedOL, to construct a larger and more comprehensive server model in one-shot settings (i.e., in a single communication round). Instead of model parameter sharing, FedOL employs knowledge distillation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.