Asynchronous SGD on Graphs: a Unified Framework for Asynchronous Decentralized and Federated Optimization
Mathieu Even, Anastasia Koloskova, Laurent Massouli\'e

TL;DR
This paper introduces AGRAF SGD, a unified asynchronous decentralized optimization framework that combines asynchronous communication and decentralization, improving convergence rates under milder assumptions for various algorithms.
Contribution
The paper presents AGRAF SGD, a general framework unifying asynchronous decentralized and federated optimization algorithms with improved convergence guarantees.
Findings
Provides convergence rates under milder assumptions.
Recovers or improves upon best known results for multiple algorithms.
Unifies various asynchronous decentralized algorithms under one framework.
Abstract
Decentralized and asynchronous communications are two popular techniques to speedup communication complexity of distributed machine learning, by respectively removing the dependency over a central orchestrator and the need for synchronization. Yet, combining these two techniques together still remains a challenge. In this paper, we take a step in this direction and introduce Asynchronous SGD on Graphs (AGRAF SGD) -- a general algorithmic framework that covers asynchronous versions of many popular algorithms including SGD, Decentralized SGD, Local SGD, FedBuff, thanks to its relaxed communication and computation assumptions. We provide rates of convergence under much milder assumptions than previous decentralized asynchronous works, while still recovering or even improving over the best know results for all the algorithms covered.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Optimization and Search Problems
MethodsStochastic Gradient Descent · Local SGD
