Supervised Kernel Thinning

Albert Gong; Kyuseong Choi; Raaz Dwivedi

arXiv:2410.13749·cs.LG·January 16, 2025

Supervised Kernel Thinning

Albert Gong, Kyuseong Choi, Raaz Dwivedi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper extends the kernel thinning algorithm to supervised learning, enabling faster training and inference for kernel methods like Nadaraya-Watson and kernel ridge regression, while maintaining statistical accuracy.

Contribution

It introduces Kernel-Thinned estimators for supervised learning, combining kernel thinning with classical methods to achieve quadratic speed-up and better statistical efficiency.

Findings

01

KT-based regressors are computationally more efficient than full-data estimators.

02

KT reduces training and inference times significantly.

03

KT provides improved statistical efficiency over i.i.d. subsampling.

Abstract

The kernel thinning algorithm of Dwivedi & Mackey (2024) provides a better-than-i.i.d. compression of a generic set of points. By generating high-fidelity coresets of size significantly smaller than the input points, KT is known to speed up unsupervised tasks like Monte Carlo integration, uncertainty quantification, and non-parametric hypothesis testing, with minimal loss in statistical accuracy. In this work, we generalize the KT algorithm to speed up supervised learning problems involving kernel methods. Specifically, we combine two classical algorithms--Nadaraya-Watson (NW) regression or kernel smoothing, and kernel ridge regression (KRR)--with KT to provide a quadratic speed-up in both training and inference times. We show how distribution compression with KT in each setting reduces to constructing an appropriate kernel, and introduce the Kernel-Thinned NW and Kernel-Thinned KRR…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ag2435/npr
pytorchOfficial

Videos

Supervised Kernel Thinning· slideslive

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Image and Signal Denoising Methods · Advanced Vision and Imaging

MethodsCoresets · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sparse Evolutionary Training