Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with   Communication Compression

Xinmeng Huang; Yiming Chen; Wotao Yin; Kun Yuan

arXiv:2206.03665·cs.LG·October 12, 2022·1 cites

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression

Xinmeng Huang, Yiming Chen, Wotao Yin, Kun Yuan

PDF

Open Access 1 Video

TL;DR

This paper establishes a theoretical lower bound for convergence in distributed learning with communication compression and introduces NEOLITHIC, an algorithm that nearly attains this bound, advancing understanding of communication-efficient distributed algorithms.

Contribution

The paper provides the first lower bound for convergence rates under communication compression and proposes NEOLITHIC, an algorithm that nearly matches this bound in distributed non-convex optimization.

Findings

01

Lower bound for convergence rates established for compressed communication algorithms.

02

NEOLITHIC algorithm nearly attains the theoretical lower bound, up to logarithmic factors.

03

Bidirectional contractive compression can achieve convergence speeds comparable to unidirectional unbiased compression.

Abstract

Recent advances in distributed optimization and learning have shown that communication compression is one of the most effective means of reducing communication. While there have been many results on convergence rates under communication compression, a theoretical lower bound is still missing. Analyses of algorithms with communication compression have attributed convergence to two abstract properties: the unbiased property or the contractive property. They can be applied with either unidirectional compression (only messages from workers to server are compressed) or bidirectional compression. In this paper, we consider distributed stochastic algorithms for minimizing smooth and non-convex objective functions under communication compression. We establish a convergence lower bound for algorithms whether using unbiased or contractive compressors in unidirection or bidirection. To close the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Distributed Sensor Networks and Detection Algorithms · Wireless Communication Security Techniques