Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Kevin Scaman; Francis Bach; S\'ebastien Bubeck; Yin Tat Lee and; Laurent Massouli\'e

arXiv:1806.00291·math.OC·June 4, 2018·NeurIPS·79 cites

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Kevin Scaman, Francis Bach, S\'ebastien Bubeck, Yin Tat Lee and, Laurent Massouli\'e

PDF

Open Access

TL;DR

This paper introduces optimal algorithms for distributed non-smooth convex optimization, demonstrating fast convergence rates and minimal communication impact, under different regularity assumptions.

Contribution

It presents the first optimal first-order decentralized algorithm (MSPD) for local regularity and a new distributed smoothing method (DRS) for global regularity, both with proven optimal or near-optimal rates.

Findings

01

MSPD achieves optimal convergence rate with communication network impact only in second-order term.

02

DRS is within a $d^{1/4}$ factor of the optimal rate for global regularity.

03

Communication effects diminish rapidly even for non-strongly convex functions.

Abstract

In this work, we consider the distributed optimization of non-smooth convex functions using a network of computing units. We investigate this problem under two regularity assumptions: (1) the Lipschitz continuity of the global objective function, and (2) the Lipschitz continuity of local individual functions. Under the local regularity assumption, we provide the first optimal first-order decentralized algorithm called multi-step primal-dual (MSPD) and its corresponding optimal convergence rate. A notable aspect of this result is that, for non-smooth functions, while the dominant term of the error is in $O (1/ t)$ , the structure of the communication network only impacts a second-order term in $O (1/ t)$ , where $t$ is time. In other words, the error due to limits in communication resources decreases at a fast rate even in the case of non-strongly-convex objective functions. Under the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques