ContaminatedMixt: An R Package for Fitting Parsimonious Mixtures of Multivariate Contaminated Normal Distributions
Antonio Punzo, Angelo Mazza, Paul D. McNicholas

TL;DR
ContaminatedMixt is an R package that enables robust clustering and classification using mixtures of multivariate contaminated normal distributions, with model variants, outlier detection, and parallel computation features.
Contribution
The paper introduces ContaminatedMixt, an R package implementing parsimonious mixtures of contaminated normal distributions for robust data analysis.
Findings
Effective outlier detection via maximum a posteriori probabilities.
Supports multiple model variants for parsimony.
Allows parallel computation for efficiency.
Abstract
We introduce the R package ContaminatedMixt, conceived to disseminate the use of mixtures of multivariate contaminated normal distributions as a tool for robust clustering and classification under the common assumption of elliptically contoured groups. Thirteen variants of the model are also implemented to introduce parsimony. The expectation-conditional maximization algorithm is adopted to obtain maximum likelihood parameter estimates, and likelihood-based model selection criteria are used to select the model and the number of groups. Parallel computation can be used on multicore PCs and computer clusters, when several models have to be fitted. Differently from the more popular mixtures of multivariate normal and t distributions, this approach also allows for automatic detection of mild outliers via the maximum a posteriori probabilities procedure. To exemplify the use of the package,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Advanced Statistical Methods and Models
