Doubly Random Parallel Stochastic Methods for Large Scale Learning
Aryan Mokhtari, Alec Koppel, Alejandro Ribeiro

TL;DR
This paper introduces RAPSA, a novel parallel stochastic algorithm that operates on randomly selected data blocks and training examples, demonstrating convergence in large-scale learning tasks.
Contribution
RAPSA is the first method combining parallelism in both data and feature block selection, with proven convergence for convex objectives.
Findings
Converges almost surely with decreasing stepsizes.
Converges to a neighborhood with constant stepsizes.
Shown to be effective on MNIST digit recognition.
Abstract
We consider learning problems over training sets in which both, the number of training examples and the dimension of the feature vectors, are large. To solve these problems we propose the random parallel stochastic algorithm (RAPSA). We call the algorithm random parallel because it utilizes multiple processors to operate in a randomly chosen subset of blocks of the feature vector. We call the algorithm parallel stochastic because processors choose elements of the training set randomly and independently. Algorithms that are parallel in either of these dimensions exist, but RAPSA is the first attempt at a methodology that is parallel in both, the selection of blocks and the selection of elements of the training set. In RAPSA, processors utilize the randomly chosen functions to compute the stochastic gradient component associated with a randomly chosen block. The technical contribution of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
