Faster Kernel Ridge Regression Using Sketching and Preconditioning
Haim Avron, Kenneth L. Clarkson, David P. Woodruff

TL;DR
This paper introduces a preconditioning method using random feature maps to accelerate Kernel Ridge Regression, making it scalable and efficient for large datasets with up to one million examples.
Contribution
The paper presents a novel preconditioning technique based on random feature maps that significantly speeds up solving KRR linear systems for large-scale data.
Findings
Effective preconditioning reduces computation time.
Method scales to datasets with up to one million examples.
Random features improve convergence with fewer features.
Abstract
Kernel Ridge Regression (KRR) is a simple yet powerful technique for non-parametric regression whose computation amounts to solving a linear system. This system is usually dense and highly ill-conditioned. In addition, the dimensions of the matrix are the same as the number of data points, so direct methods are unrealistic for large-scale datasets. In this paper, we propose a preconditioning technique for accelerating the solution of the aforementioned linear system. The preconditioner is based on random feature maps, such as random Fourier features, which have recently emerged as a powerful technique for speeding up and scaling the training of kernel-based methods, such as kernel ridge regression, by resorting to approximations. However, random feature maps only provide crude approximations to the kernel function, so delivering state-of-the-art results by directly solving the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference
