Communication-Efficient Distributed Learning via Lazily Aggregated   Quantized Gradients

Jun Sun; Tianyi Chen; Georgios B. Giannakis; and Zaiyue Yang

arXiv:1909.07588·cs.LG·September 18, 2019·39 cites

Communication-Efficient Distributed Learning via Lazily Aggregated Quantized Gradients

Jun Sun, Tianyi Chen, Georgios B. Giannakis, and Zaiyue Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces LAQ, a communication-efficient distributed learning method that combines gradient quantization and selective communication skipping, achieving convergence comparable to traditional methods with reduced communication costs.

Contribution

The paper proposes LAQ, a novel approach that adaptively compresses and selectively skips gradient communications, significantly reducing communication overhead while maintaining convergence rates.

Findings

01

LAQ achieves the same linear convergence rate as gradient descent in strongly convex settings.

02

LAQ reduces communication bits and rounds compared to existing algorithms.

03

Empirical results confirm significant communication savings with real data.

Abstract

The present paper develops a novel aggregated gradient approach for distributed machine learning that adaptively compresses the gradient communication. The key idea is to first quantize the computed gradients, and then skip less informative quantized gradient communications by reusing outdated gradients. Quantizing and skipping result in `lazy' worker-server communications, which justifies the term Lazily Aggregated Quantized gradient that is henceforth abbreviated as LAQ. Our LAQ can provably attain the same linear convergence rate as the gradient descent in the strongly convex case, while effecting major savings in the communication overhead both in transmitted bits as well as in communication rounds. Empirically, experiments with real data corroborate a significant communication reduction compared to existing gradient- and stochastic gradient-based algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sunjunaimer/LAQ
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Privacy-Preserving Technologies in Data