Optimal algorithms for smooth and strongly convex distributed   optimization in networks

Kevin Scaman (MSR - INRIA); Francis Bach (SIERRA); S\'ebastien Bubeck,; Yin Tat Lee; Laurent Massouli\'e (MSR - INRIA)

arXiv:1702.08704·math.OC·April 10, 2017·ICML·189 cites

Optimal algorithms for smooth and strongly convex distributed optimization in networks

Kevin Scaman (MSR - INRIA), Francis Bach (SIERRA), S\'ebastien Bubeck,, Yin Tat Lee, Laurent Massouli\'e (MSR - INRIA)

PDF

Open Access 1 Repo

TL;DR

This paper establishes the optimal convergence rates for distributed optimization in networks, introducing a new optimal decentralized algorithm and analyzing centralized methods for smooth, strongly convex functions.

Contribution

It presents the first optimal decentralized algorithm (MSDA) and determines the optimal rates for both centralized and decentralized distributed optimization.

Findings

01

Distributed Nesterov's method is optimal for centralized settings.

02

MSDA achieves optimal convergence in decentralized gossip-based networks.

03

Empirical tests confirm MSDA's efficiency on regression and classification tasks.

Abstract

In this paper, we determine the optimal convergence rates for strongly convex and smooth distributed optimization in two settings: centralized and decentralized communications over a network. For centralized (i.e. master/slave) algorithms, we show that distributing Nesterov's accelerated gradient descent is optimal and achieves a precision $ε > 0$ in time $O (κ_{g} (1 + Δ τ) ln (1/ ε))$ , where $κ_{g}$ is the condition number of the (global) function to optimize, $Δ$ is the diameter of the network, and $τ$ (resp. $1$ ) is the time needed to communicate values between two neighbors (resp. perform local computations). For decentralized algorithms based on gossip, we provide the first optimal algorithm, called the multi-step dual accelerated (MSDA) method, that achieves a precision $ε > 0$ in time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adelnabli/dadao
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques