Fast Randomized Kernel Methods With Statistical Guarantees
Ahmed El Alaoui, Michael W. Mahoney

TL;DR
This paper introduces a fast, statistically guaranteed randomized kernel method that uses importance sampling based on extended leverage scores, reducing computational complexity while maintaining statistical accuracy.
Contribution
It extends statistical leverage scores to kernel ridge regression, providing a fast algorithm to approximate these scores and improve kernel method efficiency.
Findings
Algorithm runs in O(np^2) time, scalable to large datasets.
Sampling based on leverage scores reduces sketch size to effective dimensionality.
Empirical results confirm theoretical guarantees and effectiveness.
Abstract
One approach to improving the running time of kernel-based machine learning methods is to build a small sketch of the input and use it in lieu of the full kernel matrix in the machine learning task of interest. Here, we describe a version of this approach that comes with running time guarantees as well as improved guarantees on its statistical performance. By extending the notion of \emph{statistical leverage scores} to the setting of kernel ridge regression, our main statistical result is to identify an importance sampling distribution that reduces the size of the sketch (i.e., the required number of columns to be sampled) to the \emph{effective dimensionality} of the problem. This quantity is often much smaller than previous bounds that depend on the \emph{maximal degrees of freedom}. Our main algorithmic result is to present a fast algorithm to compute approximations to these scores.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Gaussian Processes and Bayesian Inference
