The Burbea-Rao and Bhattacharyya centroids
Frank Nielsen, Sylvain Boltz

TL;DR
This paper introduces a new framework for computing centroids based on Burbea-Rao divergences, generalizing Jensen-Shannon divergence, and applies it to efficiently find Bhattacharyya centroids for exponential family distributions, enhancing clustering methods.
Contribution
It establishes the properties of Burbea-Rao centroids, links Bhattacharyya distances to these divergences, and develops an efficient algorithm for centroid computation in statistical exponential families.
Findings
Burbea-Rao centroids are unique and approximable via iterative algorithms.
Bhattacharyya distances for exponential families are equivalent to Burbea-Rao divergences.
The proposed centroid algorithm improves clustering of Gaussian mixtures.
Abstract
We study the centroid with respect to the class of information-theoretic Burbea-Rao divergences that generalize the celebrated Jensen-Shannon divergence by measuring the non-negative Jensen difference induced by a strictly convex and differentiable function. Although those Burbea-Rao divergences are symmetric by construction, they are not metric since they fail to satisfy the triangle inequality. We first explain how a particular symmetrization of Bregman divergences called Jensen-Bregman distances yields exactly those Burbea-Rao divergences. We then proceed by defining skew Burbea-Rao divergences, and show that skew Burbea-Rao divergences amount in limit cases to compute Bregman divergences. We then prove that Burbea-Rao centroids are unique, and can be arbitrarily finely approximated by a generic iterative concave-convex optimization algorithm with guaranteed convergence property. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
