Robust semi-parametric mixtures of linear experts using the contaminated Gaussian distribution
Peterson Mambondimumwe, Sphiwe B. Skhosana, Najmeh Nakhaei Rad

TL;DR
This paper introduces robust semi- and non-parametric mixture of regressions using contaminated Gaussian distributions, enabling effective outlier handling and simultaneous clustering and outlier detection.
Contribution
It proposes a novel contaminated Gaussian mixture of regressions model with algorithms for robust estimation and outlier detection, improving over traditional Gaussian-based models.
Findings
Robust models outperform Gaussian mixtures in presence of outliers.
Algorithms effectively detect outliers and perform clustering.
Simulation and real data demonstrate practical utility.
Abstract
Semi- and non-parametric mixture of regressions are a very useful flexible class of mixture of regressions in which some or all of the parameters are non-parametric functions of the covariates. These models are, however, based on the Gaussian assumption of the component error distributions. Thus, their estimation is sensitive to outliers and heavy-tailed error distributions. In this paper, we propose semi- and non-parametric contaminated Gaussian mixture of regressions to robustly estimate the parametric and/or non-parametric terms of the models in the presence of mild outliers. The virtue of using a contaminated Gaussian error distribution is that we can simultaneously perform model-based clustering of observations and model-based outlier detection. We propose two algorithms, an expectation-maximization (EM)-type algorithm and an expectation-conditional-maximization (ECM)-type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Bayesian Methods and Mixture Models · Anomaly Detection Techniques and Applications
