Copula-based mixture model identification for subgroup clustering with imaging applications
Fei Zheng, Nicolas Duchateau

TL;DR
This paper introduces a flexible copula-based mixture model for clustering, capable of handling heterogeneous component distributions, and proposes an iterative algorithm for unsupervised identification, demonstrated on synthetic, image, and medical data.
Contribution
It develops a novel GICE algorithm adapted for CBMMs, enabling unsupervised identification of complex mixture models with heterogeneous components.
Findings
Effective on synthetic two-cluster data.
Outperforms EM in MNIST clustering.
Useful for medical imaging analysis.
Abstract
Model-based clustering techniques have been widely applied to various application areas, while most studies focus on canonical mixtures with unique component distribution form. However, this strict assumption is often hard to satisfy. In this paper, we consider the more flexible Copula-Based Mixture Models (CBMMs) for clustering, which allow heterogeneous component distributions composed by flexible choices of marginal and copula forms. More specifically, we propose an adaptation of the Generalized Iterative Conditional Estimation (GICE) algorithm to identify the CBMMs in an unsupervised manner, where the marginal and copula forms and their parameters are estimated iteratively. GICE is adapted from its original version developed for switching Markov model identification with the choice of realization time. Our CBMM-GICE clustering method is then tested on synthetic two-cluster data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models
MethodsFocus
