Sketching the Krylov Subspace: Faster Computation of the Entire Ridge Regularization Path
Yifei Wang, Mert Pilanci

TL;DR
This paper introduces a fast, nearly linear time algorithm for computing the entire ridge regression regularization path by sketching the Krylov subspace, enabling efficient model tuning and improved computational complexity.
Contribution
It presents a novel Krylov subspace sketching algorithm that accelerates ridge regression path computation with adaptive sketching dimension and broad applicability.
Findings
Achieves nearly linear time complexity for ridge path computation.
Outperforms standard methods on medium and large-scale tasks.
Works for both over-determined and under-determined problems.
Abstract
We propose a fast algorithm for computing the entire ridge regression regularization path in nearly linear time. Our method constructs a basis on which the solution of ridge regression can be computed instantly for any value of the regularization parameter. Consequently, linear models can be tuned via cross-validation or other risk estimation strategies with substantially better efficiency. The algorithm is based on iteratively sketching the Krylov subspace with a binomial decomposition over the regularization path. We provide a convergence analysis with various sketching matrices and show that it improves the state-of-the-art computational complexity. We also provide a technique to adaptively estimate the sketching dimension. This algorithm works for both the over-determined and under-determined problems. We also provide an extension for matrix-valued ridge regression. The numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Matrix Theory and Algorithms · Stochastic Gradient Optimization Techniques
