Parallel Coordinate Descent Methods for Big Data Optimization
Peter Richt\'arik, Martin Tak\'a\v{c}

TL;DR
This paper demonstrates that parallel coordinate descent methods can significantly accelerate big data optimization problems, especially when the problem structure allows for high degrees of separability, achieving near-linear speedup with multiple processors.
Contribution
It introduces a theoretical framework for parallelizing coordinate descent methods, quantifies speedup based on problem separability and processor count, and validates the approach with large-scale LASSO problem results.
Findings
Speedup depends on problem separability and number of processors.
Parallel coordinate descent can solve large-scale LASSO efficiently.
The method is robust to random block updates and processor unreliability.
Abstract
In this work we show that randomized (block) coordinate descent methods can be accelerated by parallelization when applied to the problem of minimizing the sum of a partially separable smooth convex function and a simple separable convex function. The theoretical speedup, as compared to the serial method, and referring to the number of iterations needed to approximately solve the problem with high probability, is a simple expression depending on the number of parallel processors and a natural and easily computable measure of separability of the smooth component of the objective function. In the worst case, when no degree of separability is present, there may be no speedup; in the best case, when the problem is separable, the speedup is equal to the number of processors. Our analysis also works in the mode when the number of blocks being updated at each iteration is random, which allows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Parallel Coordinate Descent Methods for Big Data Optimization· youtube
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Complexity and Algorithms in Graphs
