Clustering in Hilbert simplex geometry
Frank Nielsen, Ke Sun

TL;DR
This paper introduces Hilbert simplex geometry as a new, computationally efficient way to cluster categorical distributions within the probability simplex, comparing it with existing geometric methods and demonstrating its effectiveness.
Contribution
The work presents a novel Hilbert simplex geometry framework for clustering, providing a new distance measure and benchmarking its performance against traditional information-geometric approaches.
Findings
Hilbert simplex geometry is computationally friendly for clustering.
Benchmark results show competitive performance of Hilbert geometry.
Comparison with Fisher and other divergences highlights advantages and limitations.
Abstract
Clustering categorical distributions in the finite-dimensional probability simplex is a fundamental task met in many applications dealing with normalized histograms. Traditionally, the differential-geometric structures of the probability simplex have been used either by (i) setting the Riemannian metric tensor to the Fisher information matrix of the categorical distributions, or (ii) defining the dualistic information-geometric structure induced by a smooth dissimilarity measure, the Kullback-Leibler divergence. In this work, we introduce for clustering tasks a novel computationally-friendly framework for modeling geometrically the probability simplex: The {\em Hilbert simplex geometry}. In the Hilbert simplex geometry, the distance is the non-separable Hilbert's metric distance which satisfies the property of information monotonicity with distance level set functions described by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Bayesian Methods and Mixture Models · Advanced Statistical Methods and Models
