Optimal Sketching Bounds for Sparse Linear Regression

Tung Mai; Alexander Munteanu; Cameron Musco; Anup B. Rao; Chris; Schwiegelshohn; David P. Woodruff

arXiv:2304.02261·cs.DS·April 6, 2023·1 cites

Optimal Sketching Bounds for Sparse Linear Regression

Tung Mai, Alexander Munteanu, Cameron Musco, Anup B. Rao, Chris, Schwiegelshohn, David P. Woodruff

PDF

Open Access

TL;DR

This paper establishes tight bounds on oblivious sketching dimensions for sparse linear regression across various loss functions, revealing fundamental differences from sparse recovery and extending to LASSO regression.

Contribution

It provides the first known sketching bounds for hinge-like loss functions and LASSO, demonstrating tight bounds and separations from related problems.

Findings

01

Sketching bounds for sparse $oldsymbol{ ext{ell}_p}$ regression are tight up to constants.

02

Sparse recovery is shown to be easier to sketch than sparse regression.

03

Sketching bounds for LASSO regression are tight and depend optimally on parameters.

Abstract

We study oblivious sketching for $k$ -sparse linear regression under various loss functions such as an $ℓ_{p}$ norm, or from a broad class of hinge-like loss functions, which includes the logistic and ReLU losses. We show that for sparse $ℓ_{2}$ norm regression, there is a distribution over oblivious sketches with $Θ (k lo g (d / k) / ε^{2})$ rows, which is tight up to a constant factor. This extends to $ℓ_{p}$ loss with an additional additive $O (k lo g (k / ε) / ε^{2})$ term in the upper bound. This establishes a surprising separation from the related sparse recovery problem, which is an important special case of sparse regression. For this problem, under the $ℓ_{2}$ norm, we observe an upper bound of $O (k lo g (d) / ε + k lo g (k / ε) / ε^{2})$ rows, showing that sparse recovery is strictly easier to sketch than sparse regression. For sparse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques

MethodsLinear Regression