Tackling Data Heterogeneity in Federated Learning through Knowledge Distillation with Inequitable Aggregation
Xing Ma

TL;DR
This paper introduces KDIA, a novel federated learning strategy that effectively handles client data heterogeneity and limited client participation by leveraging knowledge distillation and weighted aggregation, leading to improved model accuracy.
Contribution
The paper proposes KDIA, a new method combining knowledge distillation with inequitable aggregation to address client heterogeneity and partial participation in federated learning.
Findings
KDIA outperforms existing methods in accuracy and convergence speed.
The approach is especially effective under severe data heterogeneity.
Experimental results on CIFAR datasets validate the method's robustness.
Abstract
Federated learning aims to train a global model in a distributed environment that is close to the performance of centralized training. However, issues such as client label skew, data quantity skew, and other heterogeneity problems severely degrade the model's performance. Most existing methods overlook the scenario where only a small portion of clients participate in training within a large-scale client setting, whereas our experiments show that this scenario presents a more challenging federated learning task. Therefore, we propose a Knowledge Distillation with teacher-student Inequitable Aggregation (KDIA) strategy tailored to address the federated learning setting mentioned above, which can effectively leverage knowledge from all clients. In KDIA, the student model is the average aggregation of the participating clients, while the teacher model is formed by a weighted aggregation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques
MethodsKnowledge Distillation
