Mixture Model Averaging for Clustering
Yuhong Wei, Paul D. McNicholas

TL;DR
This paper proposes a novel approach to improve clustering by averaging multiple mixture models close to the best fit, rather than selecting a single optimal model, using Bayesian averaging and component merging techniques.
Contribution
It introduces mixture model averaging methods for clustering, including component merging based on the adjusted Rand index, enhancing robustness over traditional model selection.
Findings
Averaging multiple models improves clustering stability.
The proposed methods outperform single-model selection in experiments.
Component merging based on Rand index effectively combines similar components.
Abstract
In mixture model-based clustering applications, it is common to fit several models from a family and report clustering results from only the `best' one. In such circumstances, selection of this best model is achieved using a model selection criterion, most often the Bayesian information criterion. Rather than throw away all but the best model, we average multiple models that are in some sense close to the best one, thereby producing a weighted average of clustering results. Two (weighted) averaging approaches are considered: averaging the component membership probabilities and averaging models. In both cases, Occam's window is used to determine closeness to the best model and weights are computed within a Bayesian model averaging paradigm. In some cases, we need to merge components before averaging; we introduce a method for merging mixture components based on the adjusted Rand index.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
