On the Intrinsic Dimensions of Data in Kernel Learning

Rustem Takhanov

arXiv:2601.16139·cs.LG·January 23, 2026

On the Intrinsic Dimensions of Data in Kernel Learning

Rustem Takhanov

PDF

Open Access

TL;DR

This paper investigates two notions of intrinsic dimension in kernel learning, analyzing their relationship, impact on generalization bounds, and proposing algorithms to estimate these dimensions from data.

Contribution

It introduces a novel analysis of intrinsic dimensions in kernel learning, relating Minkowski and effective dimensions to eigenvalue decay and generalization performance.

Findings

01

Eigenvalues decay characterized by Kolmogorov n-widths

02

Effective dimension d_K can be smaller than Minkowski dimension d_ρ

03

Proposed algorithms estimate upper bounds on n-widths from samples

Abstract

The manifold hypothesis suggests that the generalization performance of machine learning methods improves significantly when the intrinsic dimension of the input distribution's support is low. In the context of KRR, we investigate two alternative notions of intrinsic dimension. The first, denoted $d_{ρ}$ , is the upper Minkowski dimension defined with respect to the canonical metric induced by a kernel function $K$ on a domain $Ω$ . The second, denoted $d_{K}$ , is the effective dimension, derived from the decay rate of Kolmogorov $n$ -widths associated with $K$ on $Ω$ . Given a probability measure $μ$ on $Ω$ , we analyze the relationship between these $n$ -widths and eigenvalues of the integral operator $ϕ \to \int_{Ω} K (\cdot, x) ϕ (x) d μ (x)$ . We show that, for a fixed domain $Ω$ , the Kolmogorov $n$ -widths characterize the worst-case eigenvalue decay across all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Face recognition and analysis