Bregman Power k-Means for Clustering Exponential Family Data
Adithya Vellal, Saptarshi Chakraborty, Jason Xu

TL;DR
This paper introduces Bregman Power k-Means, a clustering algorithm that leverages Bregman divergences for exponential family data, providing closed-form updates, theoretical bounds, and improved performance on non-Gaussian datasets.
Contribution
It develops a new clustering method combining Bregman divergences with power means, offering theoretical analysis and superior empirical results for diverse data types.
Findings
Outperforms existing methods on non-Gaussian data
Provides finite sample bounds without bounded support assumption
Offers a simple, closed-form update algorithm
Abstract
Recent progress in center-based clustering algorithms combats poor local minima by implicit annealing, using a family of generalized means. These methods are variations of Lloyd's celebrated -means algorithm, and are most appropriate for spherical clusters such as those arising from Gaussian data. In this paper, we bridge these algorithmic advances to classical work on hard clustering under Bregman divergences, which enjoy a bijection to exponential family distributions and are thus well-suited for clustering objects arising from a breadth of data generating mechanisms. The elegant properties of Bregman divergences allow us to maintain closed form updates in a simple and transparent algorithm, and moreover lead to new theoretical arguments for establishing finite sample bounds that relax the bounded support assumption made in the existing state of the art. Additionally, we consider…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Statistical Methods and Models · Statistical Methods and Inference
