Robust mixture of experts modeling using the skew $t$ distribution

Faicel Chamroukhi

arXiv:1612.06879·stat.ME·December 22, 2016

Robust mixture of experts modeling using the skew $t$ distribution

Faicel Chamroukhi

PDF

Open Access

TL;DR

This paper introduces a robust mixture of experts model using the skew t distribution, effectively handling asymmetric, heavy-tailed, and noisy data in regression and clustering tasks.

Contribution

It proposes the STMoE model with an ECM algorithm for parameter estimation, addressing limitations of normal mixture models for complex data distributions.

Findings

01

Demonstrates robustness and effectiveness on simulated data

02

Shows improved fit for non-linear regression functions

03

Successfully applied to real-world musical and climate data

Abstract

Mixture of Experts (MoE) is a popular framework in the fields of statistics and machine learning for modeling heterogeneity in data for regression, classification and clustering. MoE for continuous data are usually based on the normal distribution. However, it is known that for data with asymmetric behavior, heavy tails and atypical observations, the use of the normal distribution is unsuitable. We introduce a new robust non-normal mixture of experts modeling using the skew $t$ distribution. The proposed skew $t$ mixture of experts, named STMoE, handles these issues of the normal mixtures experts regarding possibly skewed, heavy-tailed and noisy data. We develop a dedicated expectation conditional maximization (ECM) algorithm to estimate the model parameters by monotonically maximizing the observed data log-likelihood. We describe how the presented model can be used in prediction and in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Statistical Methods and Bayesian Inference