Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

Daniele Calandriello; Alessandro Lazaric; Michal Valko

arXiv:2604.22386·stat.ML·April 27, 2026·2 cites

Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

Daniele Calandriello, Alessandro Lazaric, Michal Valko

PDF

1 Datasets

TL;DR

This paper introduces SQUEAK, an improved algorithm for kernel ridge regression that reduces space complexity by efficiently approximating ridge leverage scores without the need for normalization or estimating effective dimension.

Contribution

SQUEAK simplifies and enhances INK-Estimate by using unnormalized RLS, achieving near-optimal space complexity for large-scale kernel ridge regression.

Findings

01

SQUEAK achieves space complexity close to exact RLS sampling.

02

It simplifies the process by removing the need for effective dimension estimation.

03

The algorithm maintains accuracy with fewer columns in Nystrom approximations.

Abstract

One of the major limits of kernel ridge regression (KRR) is that storing and manipulating the kernel matrix K_n for n samples requires O(n^2) space, which rapidly becomes unfeasible for large n. Nystrom approximations reduce the space complexity to O(nm) by sampling m columns from K_n. Uniform sampling preserves KRR accuracy (up to epsilon) only when m is proportional to the maximum degree of freedom of K_n, which may require O(n) columns for datasets with high coherence. Sampling columns according to their ridge leverage scores (RLS) gives accurate Nystrom approximations with m proportional to the effective dimension, but computing exact RLS also requires O(n^2) space. (Calandriello et al. 2016) propose INK-Estimate, an algorithm that processes the dataset incrementally and updates RLS, effective dimension, and Nystrom approximations on-the-fly. Its space complexity scales with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

misovalko/my-research-papers
dataset· 103 dl
103 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.