CliPS -- How to identify cluster distributions in Bayesian mixture models
Gertraud Malsiner-Walli, Sylvia Fr\"uhwirth-Schnatter, Bettina Gr\"un

TL;DR
The paper introduces CliPS, a novel method for identifying and validating cluster structures in Bayesian mixture models by leveraging point process representations and functional separation of component posteriors.
Contribution
It presents a new procedure, CliPS, that assesses cluster distinguishability and structure directly from MCMC samples in Bayesian mixture models.
Findings
CliPS effectively identifies cluster distributions in simulated and real data.
The method improves cluster validation by exploiting posterior separation of functionals.
CliPS provides a practical tool for model-based clustering validation.
Abstract
We propose the CliPS procedure when fitting Bayesian mixture models in the context of model-based clustering to identify the cluster distributions while simultaneously assessing the suitability of a cluster solution and validating the cluster structure. The procedure relies on the point process representation of a mixture model and is based on the assumption that a suitable cluster solution requires the clusters to be distinguishable with respect to a low-dimensional functional of the component-specific parameters of the mixture. CliPS maps the component-specific MCMC draws to the point process representation and identifies clusters there, exploiting that, while data distributions usually overlap, the posterior of these functionals are more and more separated for increasing sample size. We outline the procedure and illustrate its use on several model-based clustering examples.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Advanced Clustering Algorithms Research
