Fast Distributed Coordinate Descent for Non-Strongly Convex Losses

Olivier Fercoq; Zheng Qu; Peter Richt\'arik; Martin; Tak\'a\v{c}

arXiv:1405.5300·math.OC·July 29, 2014

Fast Distributed Coordinate Descent for Non-Strongly Convex Losses

Olivier Fercoq, Zheng Qu, Peter Richt\'arik, Martin, Tak\'a\v{c}

PDF

TL;DR

This paper introduces a fast distributed coordinate descent algorithm for large-scale non-strongly convex optimization, achieving optimal convergence rates and demonstrating scalability on supercomputers for problems with billions of variables.

Contribution

It presents a novel distributed randomized coordinate descent method with proven optimal convergence for non-strongly convex functions, and demonstrates its scalability on supercomputers.

Findings

01

Achieves $O(1/k^2)$ convergence rate for non-strongly convex losses.

02

Successfully solves a synthetic LASSO problem with 50 billion variables.

03

Implemented on the UK's largest supercomputer, showing high scalability.

Abstract

We propose an efficient distributed randomized coordinate descent method for minimizing regularized non-strongly convex loss functions. The method attains the optimal $O (1/ k^{2})$ convergence rate, where $k$ is the iteration counter. The core of the work is the theoretical study of stepsize parameters. We have implemented the method on Archer - the largest supercomputer in the UK - and show that the method is capable of solving a (synthetic) LASSO optimization problem with 50 billion variables.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.