A Robust and Flexible EM Algorithm for Mixtures of Elliptical Distributions with Missing Data
Florian Mouret, Alexandre Hippert-Ferrer, Fr\'ed\'eric Pascal,, Jean-Yves Tourneret

TL;DR
This paper introduces a robust EM algorithm for imputing missing data in noisy, non-Gaussian datasets using mixtures of elliptical distributions, improving robustness to outliers and heterogeneity.
Contribution
It develops a new EM algorithm tailored for mixtures of elliptical distributions, extending classical Gaussian-based methods to handle non-Gaussian, contaminated data effectively.
Findings
The proposed algorithm is robust to outliers.
It performs well on synthetic non-Gaussian data.
It is competitive with classical imputation methods on real datasets.
Abstract
This paper tackles the problem of missing data imputation for noisy and non-Gaussian data. A classical imputation method, the Expectation Maximization (EM) algorithm for Gaussian mixture models, has shown interesting properties when compared to other popular approaches such as those based on k-nearest neighbors or on multiple imputations by chained equations. However, Gaussian mixture models are known to be non-robust to heterogeneous data, which can lead to poor estimation performance when the data is contaminated by outliers or follows non-Gaussian distributions. To overcome this issue, a new EM algorithm is investigated for mixtures of elliptical distributions with the property of handling potential missing data. This paper shows that this problem reduces to the estimation of a mixture of Angular Gaussian distributions under generic assumptions (i.e., each sample is drawn from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Census and Population Estimation
