Optimized Federated Knowledge Distillation with Distributed Neural Architecture Search
Chaimaa Medjadji, Sylvain Kubler, Yves Le Traon, Guilain Leduc, Sadi Alawadi, and Feras M. Awaysheh

TL;DR
FedKDNAS introduces a federated learning framework that enables clients to select lightweight models and distill knowledge, improving accuracy and efficiency in heterogeneous, real-world scenarios.
Contribution
It combines client-side neural architecture search with knowledge distillation, allowing adaptive model selection and communication-efficient federated learning.
Findings
Achieves up to 15% accuracy improvement under non-IID data.
Reduces client CPU usage by approximately 28%.
Decreases communication overhead by up to 44 times.
Abstract
Federated Learning (FL) enables collaborative model training without centralizing data. However, real-world deployments must simultaneously address statistical heterogeneity across client data (non-IID), system heterogeneity in device capabilities, and communication efficiency. Existing FL approaches mitigate these challenges through improved aggregation, personalization, or knowledge distillation, but they almost universally assume a fixed client architecture, limiting adaptability to heterogeneous data complexity and hardware constraints. This architectural constraint often leads to suboptimal trade-offs between accuracy and efficiency in real-world FL systems. This work introduces FedKDNAS, a distillation-driven FL framework that combines client-side neural architecture selection with distillation of server-coordinated knowledge. Each client autonomously selects a lightweight model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
