Non-Convex Over-the-Air Heterogeneous Federated Learning: A Bias-Variance Trade-off
Muhammad Faraz Ul Abrar, Nicol\`o Michelusi

TL;DR
This paper introduces a novel OTA-FL framework for non-convex models under wireless heterogeneity, balancing bias and variance to enhance convergence and generalization.
Contribution
It develops a structured bias-variance trade-off in OTA-FL with non-convex objectives and proposes an SCA-based power control algorithm using only statistical CSI.
Findings
Accelerates convergence compared to prior OTA-FL methods.
Improves model generalization in non-convex image classification tasks.
Validates effectiveness through experiments on real-world data.
Abstract
Over-the-air (OTA) federated learning (FL) has been well recognized as a scalable paradigm that exploits the waveform superposition of the wireless multiple-access channel to aggregate model updates in a single use. Existing OTA-FL designs largely enforce zero-bias model updates by either assuming \emph{homogeneous} wireless conditions (equal path loss across devices) or forcing zero-bias updates to guarantee convergence. Under \emph{heterogeneous} wireless scenarios, however, such designs are constrained by the weakest device and inflate the update variance. Moreover, prior analyses of biased OTA-FL largely address convex objectives, while most modern AI models are highly non-convex. Motivated by these gaps, we study OTA-FL with stochastic gradient descent (SGD) for general smooth non-convex objectives under wireless heterogeneity. We develop novel OTA-FL SGD updates that allow a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
