Towards a Larger Model via One-Shot Federated Learning on Heterogeneous Client Models
Wenxuan Ye, Xueli An, Onur Ayan, Junfan Wang, Xueqiang Yan, Georg Carle

TL;DR
FedOL enables one-shot federated learning of larger, more comprehensive server models by exchanging predictions instead of raw data or model weights, effectively handling heterogeneous client models and data biases.
Contribution
The paper introduces FedOL, a novel one-shot federated learning framework using knowledge distillation to build larger server models with heterogeneous client architectures.
Findings
FedOL outperforms existing baselines in simulation tests.
It reduces communication overhead by exchanging predictions.
It effectively handles data bias and model heterogeneity.
Abstract
Large models, renowned for superior performance, outperform smaller ones even without billion-parameter scales. While mobile network servers have ample computational resources to support larger models than client devices, privacy constraints prevent clients from directly sharing their raw data. Federated Learning (FL) enables decentralized clients to collaboratively train a shared model by exchanging model parameters instead of transmitting raw data. Yet, it requires a uniform model architecture and multiple communication rounds, which neglect resource heterogeneity, impose heavy computational demands on clients, and increase communication overhead. To address these challenges, we propose FedOL, to construct a larger and more comprehensive server model in one-shot settings (i.e., in a single communication round). Instead of model parameter sharing, FedOL employs knowledge distillation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
