Towards Tight Communication Lower Bounds for Distributed Optimisation
Dan Alistarh, Janne H. Korhonen

TL;DR
This paper establishes fundamental lower bounds on the communication required for distributed optimization, showing that a certain amount of bits must be exchanged to achieve a specified accuracy, and introduces a matching algorithm for quadratic objectives.
Contribution
It provides the first unconditional communication lower bounds for distributed optimization, applicable to both deterministic and randomized algorithms without structural assumptions.
Findings
Total communication lower bound of Nd g d / N bits for psilon-approximate solutions.
The bounds are tight for quadratic objectives, with a new quantized gradient descent algorithm matching the lower bounds within constant factors.
Abstract
We consider a standard distributed optimisation setting where machines, each holding a -dimensional function , aim to jointly minimise the sum of the functions . This problem arises naturally in large-scale distributed optimisation, where a standard solution is to apply variants of (stochastic) gradient descent. We focus on the communication complexity of this problem: our main result provides the first fully unconditional bounds on total number of bits which need to be sent and received by the machines to solve this problem under point-to-point communication, within a given error-tolerance. Specifically, we show that total bits need to be communicated between the machines to find an additive -approximation to the minimum of . The result holds for both deterministic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComplexity and Algorithms in Graphs · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data
