Communication and Energy Efficient Federated Learning using Zero-Order   Optimization Technique

Elissa Mhanna; Mohamad Assaad

arXiv:2409.16456·cs.LG·September 26, 2024

Communication and Energy Efficient Federated Learning using Zero-Order Optimization Technique

Elissa Mhanna, Mohamad Assaad

PDF

Open Access

TL;DR

This paper introduces a zero-order optimization approach for federated learning that significantly reduces communication and energy costs by transmitting only a single scalar per device per iteration, with proven convergence.

Contribution

It presents a novel zero-order federated learning method with theoretical convergence guarantees, addressing communication and energy efficiency challenges.

Findings

01

Reduces communication overhead compared to standard FL methods.

02

Proves convergence of the zero-order FL method in non-convex settings.

03

Demonstrates improved energy efficiency in practical scenarios.

Abstract

Federated learning (FL) is a popular machine learning technique that enables multiple users to collaboratively train a model while maintaining the user data privacy. A significant challenge in FL is the communication bottleneck in the upload direction, and thus the corresponding energy consumption of the devices, attributed to the increasing size of the model/gradient. In this paper, we address this issue by proposing a zero-order (ZO) optimization method that requires the upload of a quantized single scalar per iteration by each device instead of the whole gradient vector. We prove its theoretical convergence and find an upper bound on its convergence rate in the non-convex setting, and we discuss its implementation in practical scenarios. Our FL method and the corresponding convergence analysis take into account the impact of quantization and packet dropping due to wireless errors. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Stochastic Gradient Optimization Techniques · Neural Networks and Applications