An Affine-Invariant Bayesian Cluster Process
Hsin-Hsiung Huang, Jie Yang

TL;DR
This paper introduces a Bayesian clustering method invariant to various affine transformations, capable of identifying the number and structure of clusters without prior knowledge, applicable to diverse real-world datasets.
Contribution
It develops a novel affine-invariant Bayesian cluster process with a split-merge algorithm, handling unknown cluster counts and various linear transformations.
Findings
Effective on synthetic and real datasets
Identifies clusters invariant to transformations
Applicable in genomics and geographic data
Abstract
In order to identify clusters of objects with features transformed by unknown affine transformations, we develop a Bayesian cluster process which is invariant with respect to certain linear transformations of the feature space and able to cluster data without knowing the number of clusters in advance. Specifically, our proposed method can identify clusters invariant to orthogonal transformations under model I, invariant to scaling-coordinate orthogonal transformations under model II, or invariant to arbitrary non-singular linear transformations under model III. The proposed split-merge algorithm leads to an irreducible and aperiodic Markov chain, which is also efficient at identifying clusters reasonably well for various applications. We illustrate the applications of our approach to both synthetic and real data such as leukemia gene expression data for model I; wine data and two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Stochastic processes and statistical mechanics
