FedH2L: Federated Learning with Model and Statistical Heterogeneity
Yiying Li, Wei Zhou, Huaimin Wang, Haibo Mi, Timothy M. Hospedales

TL;DR
FedH2L introduces a federated learning method that handles model and data heterogeneity by using mutual distillation, enabling efficient, privacy-preserving, decentralized training across diverse participants.
Contribution
It proposes a novel federated learning framework that is model-agnostic and robust to non-IID data, using mutual distillation instead of parameter sharing.
Findings
Achieves effective global models with heterogeneous architectures.
Maintains high performance on diverse data distributions.
Reduces communication bandwidth compared to parameter-sharing methods.
Abstract
Federated learning (FL) enables distributed participants to collectively learn a strong global model without sacrificing their individual data privacy. Mainstream FL approaches require each participant to share a common network architecture and further assume that data are are sampled IID across participants. However, in real-world deployments participants may require heterogeneous network architectures; and the data distribution is almost certainly non-uniform across participants. To address these issues we introduce FedH2L, which is agnostic to both the model architecture and robust to different data distributions across participants. In contrast to approaches sharing parameters or gradients, FedH2L relies on mutual distillation, exchanging only posteriors on a shared seed set between participants in a decentralized manner. This makes it extremely bandwidth efficient, model agnostic,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Human Mobility and Location-Based Analysis
