FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted   Dual Averaging

Junyi Li; Feihu Huang; Heng Huang

arXiv:2302.06103·cs.LG·February 14, 2023

FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging

Junyi Li, Feihu Huang, Heng Huang

PDF

Open Access

TL;DR

FedDA introduces a flexible framework for adaptive gradient methods in federated learning, utilizing restarted dual averaging to improve convergence rates and outperform existing algorithms.

Contribution

The paper proposes FedDA, a novel framework that incorporates adaptive gradient methods into federated learning with theoretical convergence guarantees.

Findings

01

FedDA-MVR achieves optimal gradient and communication complexity.

02

FedDA matches the best known rates for first-order federated learning algorithms.

03

Numerical experiments confirm the effectiveness of FedDA.

Abstract

Federated learning (FL) is an emerging learning paradigm to tackle massively distributed data. In Federated Learning, a set of clients jointly perform a machine learning task under the coordination of a server. The FedAvg algorithm is one of the most widely used methods to solve Federated Learning problems. In FedAvg, the learning rate is a constant rather than changing adaptively. The adaptive gradient methods show superior performance over the constant learning rate schedule; however, there is still no general framework to incorporate adaptive gradient methods into the federated setting. In this paper, we propose \textbf{FedDA}, a novel framework for local adaptive gradient methods. The framework adopts a restarted dual averaging technique and is flexible with various gradient estimation methods and adaptive learning rate formulations. In particular, we analyze \textbf{FedDA-MVR}, an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Machine Learning and ELM