Minimax Rates for Estimating the Dimension of a Manifold
Jisu Kim, Alessandro Rinaldo, Larry Wasserman

TL;DR
This paper establishes the statistical limits for estimating the intrinsic dimension of a manifold supporting data, providing bounds on the minimax rates for dimension testing and estimation in high-dimensional settings.
Contribution
It derives the first rigorous upper and lower bounds on the minimax rates for manifold dimension estimation, advancing understanding of the problem's statistical difficulty.
Findings
Upper bound on dimension testing probability: $O(n^{-(d_2/d_1 - 1 - \\epsilon)n})$
Lower bound on testing: $\\Omega(n^{-(2d_2 - 2d_1 + \\epsilon)n})$
Minimax rates for estimating manifold dimension: upper bound $O(n^{-(1/m-1 - \\epsilon)n})$, lower bound $\\Omega(n^{-(2+ \\epsilon)n})$
Abstract
Many algorithms in machine learning and computational geometry require, as input, the intrinsic dimension of the manifold that supports the probability distribution of the data. This parameter is rarely known and therefore has to be estimated. We characterize the statistical difficulty of this problem by deriving upper and lower bounds on the minimax rate for estimating the dimension. First, we consider the problem of testing the hypothesis that the support of the data-generating probability distribution is a well-behaved manifold of intrinsic dimension versus the alternative that it is of dimension , with . With an i.i.d. sample of size , we provide an upper bound on the probability of choosing the wrong dimension of , where is an arbitrarily small positive number. The proof is based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
