Optimal algorithms for smooth and strongly convex distributed optimization in networks
Kevin Scaman (MSR - INRIA), Francis Bach (SIERRA), S\'ebastien Bubeck,, Yin Tat Lee, Laurent Massouli\'e (MSR - INRIA)

TL;DR
This paper establishes the optimal convergence rates for distributed optimization in networks, introducing a new optimal decentralized algorithm and analyzing centralized methods for smooth, strongly convex functions.
Contribution
It presents the first optimal decentralized algorithm (MSDA) and determines the optimal rates for both centralized and decentralized distributed optimization.
Findings
Distributed Nesterov's method is optimal for centralized settings.
MSDA achieves optimal convergence in decentralized gossip-based networks.
Empirical tests confirm MSDA's efficiency on regression and classification tasks.
Abstract
In this paper, we determine the optimal convergence rates for strongly convex and smooth distributed optimization in two settings: centralized and decentralized communications over a network. For centralized (i.e. master/slave) algorithms, we show that distributing Nesterov's accelerated gradient descent is optimal and achieves a precision in time , where is the condition number of the (global) function to optimize, is the diameter of the network, and (resp. ) is the time needed to communicate values between two neighbors (resp. perform local computations). For decentralized algorithms based on gossip, we provide the first optimal algorithm, called the multi-step dual accelerated (MSDA) method, that achieves a precision in time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
