Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks
Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar, Umut Simsekli,, Lingjiong Zhu

TL;DR
This paper introduces distributed accelerated stochastic gradient methods for multi-agent networks, achieving optimal convergence rates and acceleration in decentralized convex optimization with noisy gradients.
Contribution
It develops a framework for tuning stepsize and momentum to optimize performance, proving acceleration with iteration complexity scaling with the square root of the condition number.
Findings
Achieves accelerated convergence with ( \, ( \, ( \, Optimal iteration complexity scales with ( \, Provides tight bounds for quadratic functions and a multistage version with linear bias decay.
The methods are robust to gradient noise and outperform existing algorithms in numerical experiments.
Abstract
We study distributed stochastic gradient (D-SG) method and its accelerated variant (D-ASG) for solving decentralized strongly convex stochastic optimization problems where the objective function is distributed over several computational units, lying on a fixed but arbitrary connected communication graph, subject to local communication constraints where noisy estimates of the gradients are available. We develop a framework which allows to choose the stepsize and the momentum parameters of these algorithms in a way to optimize performance by systematically trading off the bias, variance, robustness to gradient noise and dependence to network effects. When gradients do not contain noise, we also prove that distributed accelerated methods can \emph{achieve acceleration}, requiring gradient evaluations and …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Energy Efficient Wireless Sensor Networks
