Infinite mixtures of multivariate normal-inverse Gaussian distributions for clustering of skewed data
Yuan Fang, Dimitris Karlis, Sanjeena Subedi

TL;DR
This paper introduces an infinite mixture model using Dirichlet process for clustering skewed data with heavy tails, eliminating the need to pre-specify the number of clusters and improving flexibility.
Contribution
It develops a Bayesian nonparametric framework for mixtures of MNIG distributions, allowing the number of clusters to be inferred automatically.
Findings
Provides competitive clustering results on benchmark datasets
Demonstrates effective parameter recovery in simulations
Reduces uncertainty in the number of clusters
Abstract
Mixtures of multivariate normal inverse Gaussian (MNIG) distributions can be used to cluster data that exhibit features such as skewness and heavy tails. However, for cluster analysis, using a traditional finite mixture model framework, either the number of components needs to be known - or needs to be estimated - using some model selection criterion after deriving results for a range of possible number of components. However, different model selection criteria can sometimes result in different number of components yielding uncertainty. Here, an infinite mixture model framework, also known as Dirichlet process mixture model, is proposed for the mixtures of MNIG distributions. This Dirichlet process mixture model approach allows the number of components to grow or decay freely from 1 to (in practice from 1 to ) and the number of components is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Distribution Estimation and Applications
