Avoiding Synchronization in First-Order Methods for Sparse Convex   Optimization

Aditya Devarakonda; Kimon Fountoulakis; James Demmel; Michael W.; Mahoney

arXiv:1712.06047·cs.DC·December 19, 2017

Avoiding Synchronization in First-Order Methods for Sparse Convex Optimization

Aditya Devarakonda, Kimon Fountoulakis, James Demmel, Michael W., Mahoney

PDF

TL;DR

This paper introduces synchronization-avoiding techniques for first-order methods in sparse convex optimization, significantly reducing communication costs and achieving notable speedups in parallel computing environments.

Contribution

It extends communication-avoiding Krylov methods to first-order algorithms, enabling scalable parallel optimization with reduced synchronization overhead.

Findings

01

Achieved up to 5.1x speedup on supercomputers.

02

Reduced communication latency by a tunable factor s.

03

Maintained numerical stability of the methods.

Abstract

Parallel computing has played an important role in speeding up convex optimization methods for big data analytics and large-scale machine learning (ML). However, the scalability of these optimization methods is inhibited by the cost of communicating and synchronizing processors in a parallel setting. Iterative ML methods are particularly sensitive to communication cost since they often require communication every iteration. In this work, we extend well-known techniques from Communication-Avoiding Krylov subspace methods to first-order, block coordinate descent methods for Support Vector Machines and Proximal Least-Squares problems. Our Synchronization-Avoiding (SA) variants reduce the latency cost by a tunable factor of $s$ at the expense of a factor of $s$ increase in flops and bandwidth costs. We show that the SA-variants are numerically stable and can attain large speedups of up to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.