Communication-Efficient Stochastic Zeroth-Order Optimization for Federated Learning
Wenzhi Fang, Ziyi Yu, Yuning Jiang, Yuanming Shi, Colin N. Jones, and, Yong Zhou

TL;DR
This paper introduces a communication-efficient zeroth-order federated learning algorithm that works without gradient information, suitable for black-box scenarios, and enhances efficiency through local updates and over-the-air computation.
Contribution
It proposes the FedZO algorithm for derivative-free federated learning, including an AirComp-assisted version for wireless networks, with theoretical convergence analysis and practical validation.
Findings
FedZO converges under non-convex, non-i.i.d. data.
Over-the-air FedZO maintains convergence with proper transceiver design.
Simulation confirms effectiveness and theoretical insights.
Abstract
Federated learning (FL), as an emerging edge artificial intelligence paradigm, enables many edge devices to collaboratively train a global model without sharing their private data. To enhance the training efficiency of FL, various algorithms have been proposed, ranging from first-order to second-order methods. However, these algorithms cannot be applied in scenarios where the gradient information is not available, e.g., federated black-box attack and federated hyperparameter tuning. To address this issue, in this paper we propose a derivative-free federated zeroth-order optimization (FedZO) algorithm featured by performing multiple local updates based on stochastic gradient estimators in each communication round and enabling partial device participation. Under non-convex settings, we derive the convergence performance of the FedZO algorithm on non-independent and identically distributed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced MIMO Systems Optimization · Stochastic Gradient Optimization Techniques
