Parallel Stochastic Asynchronous Coordinate Descent: Tight Bounds on the   Possible Parallelism

Yun Kuen Cheung; Richard Cole; Yixin Tao

arXiv:1811.05087·math.OC·November 23, 2020·1 cites

Parallel Stochastic Asynchronous Coordinate Descent: Tight Bounds on the Possible Parallelism

Yun Kuen Cheung, Richard Cole, Yixin Tao

PDF

Open Access

TL;DR

This paper establishes tight bounds on the maximum parallelism achievable in asynchronous stochastic coordinate descent algorithms, confirming the theoretical limits of linear speedup in parallel optimization.

Contribution

It proves that the known bounds on parallelism are tight for nearly all parameter values, providing a precise characterization of the algorithm's scalability.

Findings

01

The established bounds are tight for almost all parameter values.

02

Linear speedup is limited by the ratio of Lipschitz parameters.

03

The results confirm the optimality of existing parallel coordinate descent methods.

Abstract

Several works have shown linear speedup is achieved by an asynchronous parallel implementation of stochastic coordinate descent so long as there is not too much parallelism. More specifically, it is known that if all updates are of similar duration, then linear speedup is possible with up to $Θ (n L_{m a x} / L_{\overline{res}})$ processors, where $L_{m a x}$ and $L_{\overline{res}}$ are suitable Lipschitz parameters. This paper shows the bound is tight for almost all possible values of these parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Quantum Computing Algorithms and Architecture · Error Correcting Code Techniques