Preservation of the Global Knowledge by Not-True Distillation in Federated Learning
Gihun Lee, Minchan Jeong, Yongjin Shin, Sangmin Bae, Se-Young Yun

TL;DR
This paper introduces FedNTD, a federated learning algorithm that mitigates forgetting by preserving knowledge of not-true classes, improving global model convergence amid data heterogeneity without extra privacy or communication costs.
Contribution
The paper proposes FedNTD, a novel method that reduces forgetting of not-true classes in federated learning, enhancing model performance under data heterogeneity.
Findings
FedNTD achieves state-of-the-art results across various setups.
It effectively mitigates forgetting of not-true classes.
No additional privacy or communication costs incurred.
Abstract
In federated learning, a strong global model is collaboratively learned by aggregating clients' locally trained models. Although this precludes the need to access clients' data directly, the global model's convergence often suffers from data heterogeneity. This study starts from an analogy to continual learning and suggests that forgetting could be the bottleneck of federated learning. We observe that the global model forgets the knowledge from previous rounds, and the local training induces forgetting the knowledge outside of the local distribution. Based on our findings, we hypothesize that tackling down forgetting will relieve the data heterogeneity problem. To this end, we propose a novel and effective algorithm, Federated Not-True Distillation (FedNTD), which preserves the global perspective on locally available data only for the not-true classes. In the experiments, FedNTD shows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning
