Fast $k$-means clustering in Riemannian manifolds via Fr\'{e}chet maps: Applications to large-dimensional SPD matrices
Ji Shi, Nicolas Charon, Andreas Mang, Demetrio Labate, Robert Azencott

TL;DR
This paper presents a fast and efficient k-means clustering framework for high-dimensional non-Euclidean manifolds, especially SPD matrices, using Fréchet maps to embed manifold data into Euclidean space for scalable clustering.
Contribution
The authors introduce the $p$-Fréchet map for embedding manifold data into Euclidean space, enabling fast k-means clustering on high-dimensional non-Euclidean data, with rigorous analysis and extensive experiments.
Findings
Reduces runtime by up to two orders of magnitude compared to intrinsic methods.
Maintains high clustering accuracy on synthetic and real SPD data.
Effective even where existing methods struggle or fail.
Abstract
We introduce a novel, efficient framework for clustering data on high-dimensional, non-Euclidean manifolds that overcomes the computational challenges associated with standard intrinsic methods. The key innovation is the use of the -Fr\'{e}chet map -- defined on a generic metric space -- which embeds the manifold data into a lower-dimensional Euclidean space using a set of reference points , . Once embedded, we can efficiently and accurately apply standard Euclidean clustering techniques such as k-means. We rigorously analyze the mathematical properties of in the Euclidean space and the challenging manifold of symmetric positive definite matrices . Extensive numerical experiments using synthetic and real data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMorphological variations and asymmetry · Advanced Clustering Algorithms Research · Topological and Geometric Data Analysis
