Consistency of Cheeger and Ratio Graph Cuts
Nicolas Garcia Trillos, Dejan Slepcev, James von Brecht, Thomas, Laurent, Xavier Bresson

TL;DR
This paper proves that clustering algorithms based on Cheeger and ratio graph cuts are consistent, meaning they accurately recover true data partitions as sample size grows, under specific scaling conditions.
Contribution
It establishes the theoretical consistency of Cheeger and ratio cut-based clustering methods and specifies optimal scaling conditions for their convergence.
Findings
Minimizers of graph cuts converge to continuum cuts as sample size increases
Sharp scaling conditions for connectivity radius are identified
Numerical experiments validate theoretical results and optimality in 2D
Abstract
This paper establishes the consistency of a family of graph-cut-based algorithms for clustering of data clouds. We consider point clouds obtained as samples of a ground-truth measure. We investigate approaches to clustering based on minimizing objective functionals defined on proximity graphs of the given sample. Our focus is on functionals based on graph cuts like the Cheeger and ratio cuts. We show that minimizers of the these cuts converge as the sample size increases to a minimizer of a corresponding continuum cut (which partitions the ground truth measure). Moreover, we obtain sharp conditions on how the connectivity radius can be scaled with respect to the number of sample points for the consistency to hold. We provide results for two-way and for multiway cuts. Furthermore we provide numerical experiments that illustrate the results and explore the optimality of scaling in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Advanced Clustering Algorithms Research · 3D Shape Modeling and Analysis
