Tackling Data Heterogeneity in Federated Learning through Knowledge Distillation with Inequitable Aggregation

Xing Ma

arXiv:2506.20431·cs.LG·June 26, 2025

Tackling Data Heterogeneity in Federated Learning through Knowledge Distillation with Inequitable Aggregation

Xing Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces KDIA, a novel federated learning strategy that effectively handles client data heterogeneity and limited client participation by leveraging knowledge distillation and weighted aggregation, leading to improved model accuracy.

Contribution

The paper proposes KDIA, a new method combining knowledge distillation with inequitable aggregation to address client heterogeneity and partial participation in federated learning.

Findings

01

KDIA outperforms existing methods in accuracy and convergence speed.

02

The approach is especially effective under severe data heterogeneity.

03

Experimental results on CIFAR datasets validate the method's robustness.

Abstract

Federated learning aims to train a global model in a distributed environment that is close to the performance of centralized training. However, issues such as client label skew, data quantity skew, and other heterogeneity problems severely degrade the model's performance. Most existing methods overlook the scenario where only a small portion of clients participate in training within a large-scale client setting, whereas our experiments show that this scenario presents a more challenging federated learning task. Therefore, we propose a Knowledge Distillation with teacher-student Inequitable Aggregation (KDIA) strategy tailored to address the federated learning setting mentioned above, which can effectively leverage knowledge from all clients. In KDIA, the student model is the average aggregation of the participating clients, while the teacher model is formed by a weighted aggregation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maxiaum/kdia
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques

MethodsKnowledge Distillation