Optimal Transceiver Design in Over-the-Air Federated Distillation
Zihao Hu (1), Jia Yan (2), Ying-Jun Angela Zhang (1), Jun Zhang (3), Khaled B. Letaief (3) ((1) The Chinese University of Hong Kong, (2) The Hong Kong University of Science, Technology (Guangzhou), (3) The Hong Kong University of Science, Technology)

TL;DR
This paper introduces an over-the-air federated distillation framework that reduces communication overhead in federated learning by sharing model outputs instead of parameters, with optimized transceiver design to enhance learning convergence.
Contribution
It proposes a novel over-the-air federated distillation method with analytical convergence analysis and optimal transceiver design, improving communication efficiency in federated learning.
Findings
Significant reduction in communication overhead.
Minor compromise in testing accuracy.
Optimal transceiver solutions derived and validated.
Abstract
The rapid proliferation and growth of artificial intelligence (AI) has led to the development of federated learning (FL). FL allows wireless devices (WDs) to cooperatively learn by sharing only local model parameters, without needing to share the entire dataset. However, the emergence of large AI models has made existing FL approaches inefficient, due to the significant communication overhead required. In this paper, we propose a novel over-the-air federated distillation (FD) framework by synergizing the strength of FL and knowledge distillation to avoid the heavy local model transmission. Instead of sharing the model parameters, only the WDs' model outputs, referred to as knowledge, are shared and aggregated over-the-air by exploiting the superposition property of the multiple-access channel. We shall study the transceiver design in over-the-air FD, aiming to maximize the learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
