Asymptotics for Sketching in Least Squares Regression
Edgar Dobriban, Sifan Liu

TL;DR
This paper analyzes the asymptotic behavior of sketching methods in least squares regression, revealing fundamental differences between methods like SRHT and Gaussian projections, and providing theoretical limits on their accuracy loss.
Contribution
It provides a fine-grained asymptotic analysis distinguishing the performance of different sketching methods in least squares regression.
Findings
SRHT outperforms Gaussian projections in asymptotic accuracy.
Theoretical limits of estimation and test error are established.
Results are validated on real and synthetic datasets.
Abstract
We consider a least squares regression problem where the data has been generated from a linear model, and we are interested to learn the unknown regression parameters. We consider "sketch-and-solve" methods that randomly project the data first, and do regression after. Previous works have analyzed the statistical and computational performance of such methods. However, the existing analysis is not fine-grained enough to show the fundamental differences between various methods, such as the Subsampled Randomized Hadamard Transform (SRHT) and Gaussian projections. In this paper, we make progress on this problem, working in an asymptotic framework where the number of datapoints and dimension of features goes to infinity. We find the limits of the accuracy loss (for estimation and test error) incurred by popular sketching methods. We show separation between different methods, so that SRHT is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
