Robust mixture of experts modeling using the skew $t$ distribution
Faicel Chamroukhi

TL;DR
This paper introduces a robust mixture of experts model using the skew t distribution, effectively handling asymmetric, heavy-tailed, and noisy data in regression and clustering tasks.
Contribution
It proposes the STMoE model with an ECM algorithm for parameter estimation, addressing limitations of normal mixture models for complex data distributions.
Findings
Demonstrates robustness and effectiveness on simulated data
Shows improved fit for non-linear regression functions
Successfully applied to real-world musical and climate data
Abstract
Mixture of Experts (MoE) is a popular framework in the fields of statistics and machine learning for modeling heterogeneity in data for regression, classification and clustering. MoE for continuous data are usually based on the normal distribution. However, it is known that for data with asymmetric behavior, heavy tails and atypical observations, the use of the normal distribution is unsuitable. We introduce a new robust non-normal mixture of experts modeling using the skew distribution. The proposed skew mixture of experts, named STMoE, handles these issues of the normal mixtures experts regarding possibly skewed, heavy-tailed and noisy data. We develop a dedicated expectation conditional maximization (ECM) algorithm to estimate the model parameters by monotonically maximizing the observed data log-likelihood. We describe how the presented model can be used in prediction and in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Statistical Methods and Bayesian Inference
