Ensemble Distillation for Robust Model Fusion in Federated Learning
Tao Lin, Lingjing Kong, Sebastian U. Stich, Martin Jaggi

TL;DR
This paper introduces ensemble distillation for federated learning, enabling flexible aggregation of heterogeneous models, leading to faster training with fewer communication rounds while maintaining privacy and efficiency.
Contribution
It proposes a novel ensemble distillation method for model fusion in federated learning, allowing aggregation of diverse models without structural constraints.
Findings
Faster training of server model with fewer communication rounds
Effective aggregation of heterogeneous models in FL
Maintains privacy and reduces communication costs
Abstract
Federated Learning (FL) is a machine learning setting where many devices collaboratively train a machine learning model while keeping the training data decentralized. In most of the current training schemes the central model is refined by averaging the parameters of the server model and the updated parameters from the client side. However, directly averaging model parameters is only possible if all models have the same structure and size, which could be a restrictive constraint in many scenarios. In this work we investigate more powerful and more flexible aggregation schemes for FL. Specifically, we propose ensemble distillation for model fusion, i.e. training the central classifier through unlabeled data on the outputs of the models from the clients. This knowledge distillation technique mitigates privacy risk and cost to the same extent as the baseline FL algorithms, but allows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Traffic Prediction and Management Techniques · Adversarial Robustness in Machine Learning
MethodsKnowledge Distillation
