Efficient Randomized Subspace Embeddings for Distributed Optimization under a Communication Budget
Rajarshi Saha, Mert Pilanci, Andrea J. Goldsmith

TL;DR
This paper introduces efficient randomized subspace embedding techniques for distributed optimization under strict communication budgets, achieving optimal convergence rates and covering efficiency across various convexity and smoothness settings.
Contribution
It proposes a polynomial complexity source coding scheme using random subspace embeddings that are optimal for any bit-budget, including sub-linear and high regimes, with practical near-linear time variants.
Findings
Achieves convergence rates matching information-theoretic lower bounds.
Provides a source coding scheme with optimal covering efficiency.
Enhances gradient sparsification schemes with the proposed embeddings.
Abstract
We study first-order optimization algorithms under the constraint that the descent direction is quantized using a pre-specified budget of -bits per dimension, where . We propose computationally efficient optimization algorithms with convergence rates matching the information-theoretic performance lower bounds for: (i) Smooth and Strongly-Convex objectives with access to an Exact Gradient oracle, as well as (ii) General Convex and Non-Smooth objectives with access to a Noisy Subgradient oracle. The crux of these algorithms is a polynomial complexity source coding scheme that embeds a vector into a random subspace before quantizing it. These embeddings are such that with high probability, their projection along any of the canonical directions of the transform space is small. As a consequence, quantizing these embeddings followed by an inverse transform to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Face and Expression Recognition
