Robust Distributed Accelerated Stochastic Gradient Methods for   Multi-Agent Networks

Alireza Fallah; Mert Gurbuzbalaban; Asuman Ozdaglar; Umut Simsekli,; Lingjiong Zhu

arXiv:1910.08701·math.OC·October 5, 2021·24 cites

Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks

Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar, Umut Simsekli,, Lingjiong Zhu

PDF

Open Access

TL;DR

This paper introduces distributed accelerated stochastic gradient methods for multi-agent networks, achieving optimal convergence rates and acceleration in decentralized convex optimization with noisy gradients.

Contribution

It develops a framework for tuning stepsize and momentum to optimize performance, proving acceleration with iteration complexity scaling with the square root of the condition number.

Findings

01

Achieves accelerated convergence with ( \, ( \, ( \, Optimal iteration complexity scales with ( \, Provides tight bounds for quadratic functions and a multistage version with linear bias decay.

02

The methods are robust to gradient noise and outperform existing algorithms in numerical experiments.

Abstract

We study distributed stochastic gradient (D-SG) method and its accelerated variant (D-ASG) for solving decentralized strongly convex stochastic optimization problems where the objective function is distributed over several computational units, lying on a fixed but arbitrary connected communication graph, subject to local communication constraints where noisy estimates of the gradients are available. We develop a framework which allows to choose the stepsize and the momentum parameters of these algorithms in a way to optimize performance by systematically trading off the bias, variance, robustness to gradient noise and dependence to network effects. When gradients do not contain noise, we also prove that distributed accelerated methods can \emph{achieve acceleration}, requiring $O (κ lo g (1/ ε))$ gradient evaluations and $O (κ lo g (1/ ε))$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Energy Efficient Wireless Sensor Networks