On the Intrinsic Dimensions of Data in Kernel Learning
Rustem Takhanov

TL;DR
This paper investigates two notions of intrinsic dimension in kernel learning, analyzing their relationship, impact on generalization bounds, and proposing algorithms to estimate these dimensions from data.
Contribution
It introduces a novel analysis of intrinsic dimensions in kernel learning, relating Minkowski and effective dimensions to eigenvalue decay and generalization performance.
Findings
Eigenvalues decay characterized by Kolmogorov n-widths
Effective dimension d_K can be smaller than Minkowski dimension d_ρ
Proposed algorithms estimate upper bounds on n-widths from samples
Abstract
The manifold hypothesis suggests that the generalization performance of machine learning methods improves significantly when the intrinsic dimension of the input distribution's support is low. In the context of KRR, we investigate two alternative notions of intrinsic dimension. The first, denoted , is the upper Minkowski dimension defined with respect to the canonical metric induced by a kernel function on a domain . The second, denoted , is the effective dimension, derived from the decay rate of Kolmogorov -widths associated with on . Given a probability measure on , we analyze the relationship between these -widths and eigenvalues of the integral operator . We show that, for a fixed domain , the Kolmogorov -widths characterize the worst-case eigenvalue decay across all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Face recognition and analysis
