Supervised Kernel Thinning
Albert Gong, Kyuseong Choi, Raaz Dwivedi

TL;DR
This paper extends the kernel thinning algorithm to supervised learning, enabling faster training and inference for kernel methods like Nadaraya-Watson and kernel ridge regression, while maintaining statistical accuracy.
Contribution
It introduces Kernel-Thinned estimators for supervised learning, combining kernel thinning with classical methods to achieve quadratic speed-up and better statistical efficiency.
Findings
KT-based regressors are computationally more efficient than full-data estimators.
KT reduces training and inference times significantly.
KT provides improved statistical efficiency over i.i.d. subsampling.
Abstract
The kernel thinning algorithm of Dwivedi & Mackey (2024) provides a better-than-i.i.d. compression of a generic set of points. By generating high-fidelity coresets of size significantly smaller than the input points, KT is known to speed up unsupervised tasks like Monte Carlo integration, uncertainty quantification, and non-parametric hypothesis testing, with minimal loss in statistical accuracy. In this work, we generalize the KT algorithm to speed up supervised learning problems involving kernel methods. Specifically, we combine two classical algorithms--Nadaraya-Watson (NW) regression or kernel smoothing, and kernel ridge regression (KRR)--with KT to provide a quadratic speed-up in both training and inference times. We show how distribution compression with KT in each setting reduces to constructing an appropriate kernel, and introduce the Kernel-Thinned NW and Kernel-Thinned KRR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Image and Signal Denoising Methods · Advanced Vision and Imaging
MethodsCoresets · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sparse Evolutionary Training
