Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging
Shusen Wang, Alex Gittens, Michael W. Mahoney

TL;DR
This paper investigates the effects of classical and Hessian sketching methods on matrix ridge regression, revealing their impacts on optimization, statistical bias-variance trade-offs, and how model averaging can mitigate increased risks.
Contribution
It provides a comprehensive analysis of sketching impacts on MRR, including bounds on bias and variance, and demonstrates the effectiveness of model averaging in reducing risks.
Findings
Classical sketch recovers nearly optimal solutions for MRR.
Hessian sketch's approximation error depends on response mass and optimal objective.
Model averaging significantly reduces the risk gap between true and sketched MRR solutions.
Abstract
We address the statistical and optimization impacts of the classical sketch and Hessian sketch used to approximately solve the Matrix Ridge Regression (MRR) problem. Prior research has quantified the effects of classical sketch on the strictly simpler least squares regression (LSR) problem. We establish that classical sketch has a similar effect upon the optimization properties of MRR as it does on those of LSR: namely, it recovers nearly optimal solutions. By contrast, Hessian sketch does not have this guarantee, instead, the approximation error is governed by a subtle interplay between the "mass" in the responses and the optimal objective value. For both types of approximation, the regularization in the sketched MRR problem results in significantly different statistical properties from those of the sketched LSR problem. In particular, there is a bias-variance trade-off in sketched…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference
