Clustering using skewed multivariate heavy tailed distributions with flexible tail behaviour
Darren Wraith, Florence Forbes

TL;DR
This paper introduces a flexible multivariate clustering method based on skewed heavy-tailed distributions, extending Gaussian mixtures with a multiple scaled framework to better model diverse tail and skewness behaviors.
Contribution
It proposes a novel multiple scaled distribution framework allowing different tail and skewness behaviors per dimension, with an EM algorithm for parameter estimation in clustering.
Findings
Enhanced modeling of data with varying tail behaviors
Improved clustering performance on simulated and real data
Greater flexibility in capturing directional data shapes
Abstract
The family of location and scale mixtures of Gaussians has the ability to generate a number of flexible distributional forms. It nests as particular cases several important asymmetric distributions like the Generalised Hyperbolic distribution. The Generalised Hyperbolic distribution in turn nests many other well known distributions such as the Normal Inverse Gaussian (NIG) whose practical relevance has been widely documented in the literature. In a multivariate setting, we propose to extend the standard location and scale mixture concept into a so called multiple scaled framework which has the advantage of allowing different tail and skewness behaviours in each dimension of the variable space with arbitrary correlation between dimensions. Estimation of the parameters is provided via an EM algorithm with a particular focus on NIG distributions. Inference is then extended to cover the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Distribution Estimation and Applications · Financial Risk and Volatility Modeling
