Cluster Trees on Manifolds
Sivaraman Balakrishnan, Srivatsan Narayanan, Alessandro Rinaldo, Aarti, Singh, Larry Wasserman

TL;DR
This paper studies the estimation of cluster trees for densities on manifolds embedded in high-dimensional space, showing convergence rates depend only on the manifold dimension, not the ambient space.
Contribution
It introduces a modified $k$-nearest neighbor algorithm for manifold-supported densities and proves dimension-dependent convergence rates, also extending results to kernel density estimators.
Findings
Convergence rates depend only on the manifold dimension $d$, not the ambient dimension $D$.
Modified $k$-nearest neighbor algorithm achieves optimal rates under mild assumptions.
Spatially adaptive algorithms improve rates in the known manifold case.
Abstract
In this paper we investigate the problem of estimating the cluster tree for a density supported on or near a smooth -dimensional manifold isometrically embedded in . We analyze a modified version of a -nearest neighbor based algorithm recently proposed by Chaudhuri and Dasgupta. The main results of this paper show that under mild assumptions on and , we obtain rates of convergence that depend on only but not on the ambient dimension . We also show that similar (albeit non-algorithmic) results can be obtained for kernel density estimators. We sketch a construction of a sample complexity lower bound instance for a natural class of manifold oblivious clustering algorithms. We further briefly consider the known manifold case and show that in this case a spatially adaptive algorithm achieves better rates.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
