Towards Tight Communication Lower Bounds for Distributed Optimisation

Dan Alistarh; Janne H. Korhonen

arXiv:2010.08222·cs.LG·December 8, 2021·1 cites

Towards Tight Communication Lower Bounds for Distributed Optimisation

Dan Alistarh, Janne H. Korhonen

PDF

Open Access 1 Video

TL;DR

This paper establishes fundamental lower bounds on the communication required for distributed optimization, showing that a certain amount of bits must be exchanged to achieve a specified accuracy, and introduces a matching algorithm for quadratic objectives.

Contribution

It provides the first unconditional communication lower bounds for distributed optimization, applicable to both deterministic and randomized algorithms without structural assumptions.

Findings

01

Total communication lower bound of Nd g d / N bits for psilon-approximate solutions.

02

The bounds are tight for quadratic objectives, with a new quantized gradient descent algorithm matching the lower bounds within constant factors.

Abstract

We consider a standard distributed optimisation setting where $N$ machines, each holding a $d$ -dimensional function $f_{i}$ , aim to jointly minimise the sum of the functions $\sum_{i = 1}^{N} f_{i} (x)$ . This problem arises naturally in large-scale distributed optimisation, where a standard solution is to apply variants of (stochastic) gradient descent. We focus on the communication complexity of this problem: our main result provides the first fully unconditional bounds on total number of bits which need to be sent and received by the $N$ machines to solve this problem under point-to-point communication, within a given error-tolerance. Specifically, we show that $Ω (N d lo g d / N ε)$ total bits need to be communicated between the machines to find an additive $ϵ$ -approximation to the minimum of $\sum_{i = 1}^{N} f_{i} (x)$ . The result holds for both deterministic and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Tight Communication Lower Bounds for Distributed Optimisation· slideslive

Taxonomy

TopicsComplexity and Algorithms in Graphs · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data