Federated Distillation: A Survey
Lin Li, Jianping Gou, Baosheng Yu, Lan Du, Zhang Yiand, Dacheng Tao

TL;DR
Federated Distillation (FD) enhances federated learning by reducing communication costs and allowing flexible model architectures through knowledge distillation, broadening practical applications.
Contribution
This survey comprehensively reviews the latest advancements, principles, and applications of Federated Distillation in addressing federated learning challenges.
Findings
FD reduces communication costs in FL.
FD allows heterogeneous model architectures.
FD broadens application scenarios of FL.
Abstract
Federated Learning (FL) seeks to train a model collaboratively without sharing private training data from individual clients. Despite its promise, FL encounters challenges such as high communication costs for large-scale models and the necessity for uniform model architectures across all clients and the server. These challenges severely restrict the practical applications of FL. To address these limitations, the integration of knowledge distillation (KD) into FL has been proposed, forming what is known as Federated Distillation (FD). FD enables more flexible knowledge transfer between clients and the server, surpassing the mere sharing of model parameters. By eliminating the need for identical model architectures across clients and the server, FD mitigates the communication costs associated with training large-scale models. This paper aims to offer a comprehensive overview of FD,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Blockchain Technology Applications and Security · Cryptography and Data Security
MethodsKnowledge Distillation
