FedeKD: Energy-Based Gating for Robust Federated Knowledge Distillation under Heterogeneous Settings

Quang-Huy Nguyen; Jiaqi Wang; Wei-shinn Ku

arXiv:2605.05553·cs.LG·May 8, 2026

FedeKD: Energy-Based Gating for Robust Federated Knowledge Distillation under Heterogeneous Settings

Quang-Huy Nguyen, Jiaqi Wang, Wei-shinn Ku

PDF

TL;DR

FedeKD introduces an energy-based gating mechanism in federated knowledge distillation to improve robustness by estimating sample-wise trust without relying on public data, effectively reducing negative transfer in heterogeneous environments.

Contribution

The paper proposes a novel reliability-aware FKD framework with an energy-based gating mechanism for sample-wise trust estimation, enhancing robustness without needing public datasets.

Findings

01

FedeKD significantly reduces negative transfer in heterogeneous federated settings.

02

The energy-based gating mechanism improves sample-wise knowledge transfer.

03

FedeKD maintains strong predictive performance across six real-world datasets.

Abstract

Federated learning (FL) operates in heterogeneous environments, where variations in data distributions and asymmetric model design often result in negative transfer. While federated knowledge distillation (FKD) avoids direct model parameter sharing, existing methods typically rely on public datasets or assume that transferred knowledge is uniformly reliable, which limits their robustness in practice. This paper presents FedeKD, a reliability-aware FKD framework that makes sample-wise trust estimation an explicit component of knowledge transfer, without relying on additional public data. Each client maintains a high-capacity private model for local learning and a lightweight shared proxy model for cross-client knowledge exchange. During training, proxy models are aggregated on the server to form a global proxy, which is then used to guide updates of the private models. At the core of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.