Optimal Sketching Bounds for Sparse Linear Regression
Tung Mai, Alexander Munteanu, Cameron Musco, Anup B. Rao, Chris, Schwiegelshohn, David P. Woodruff

TL;DR
This paper establishes tight bounds on oblivious sketching dimensions for sparse linear regression across various loss functions, revealing fundamental differences from sparse recovery and extending to LASSO regression.
Contribution
It provides the first known sketching bounds for hinge-like loss functions and LASSO, demonstrating tight bounds and separations from related problems.
Findings
Sketching bounds for sparse $oldsymbol{ ext{ell}_p}$ regression are tight up to constants.
Sparse recovery is shown to be easier to sketch than sparse regression.
Sketching bounds for LASSO regression are tight and depend optimally on parameters.
Abstract
We study oblivious sketching for -sparse linear regression under various loss functions such as an norm, or from a broad class of hinge-like loss functions, which includes the logistic and ReLU losses. We show that for sparse norm regression, there is a distribution over oblivious sketches with rows, which is tight up to a constant factor. This extends to loss with an additional additive term in the upper bound. This establishes a surprising separation from the related sparse recovery problem, which is an important special case of sparse regression. For this problem, under the norm, we observe an upper bound of rows, showing that sparse recovery is strictly easier to sketch than sparse regression. For sparse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques
MethodsLinear Regression
