Flexible Modeling of Multivariate Skewed and Heavy-Tailed Data via a Non-Central Skew t Distribution: Application to Tumor Shape Data
Abeer M. Hasan, Ying-Ju Chen

TL;DR
This paper introduces a flexible multivariate skew t distribution that captures asymmetry and heavy tails, demonstrated on tumor shape data, outperforming traditional models in fit and interpretability.
Contribution
The paper develops a non-central skew t distribution with theoretical properties, estimation methods, and practical application to tumor shape data, expanding modeling options for complex data.
Findings
NCST model outperforms standard models in fit and diagnostics.
Monte Carlo likelihood enables effective parameter estimation.
Model captures skewness and heavy tails in tumor shape data.
Abstract
We propose a flexible formulation of the multivariate non-central skew t (NCST) distribution, defined by scaling skew-normal random vectors with independent chi-squared variables. This construction extends the classical multivariate t family by allowing both asymmetry and non-centrality, which provides an alternative to existing skew t models that often rely on restrictive assumptions for tractability. We derive key theoretical properties of the NCST distribution, which includes its moment structure, affine transformation behavior, and the distribution of quadratic forms. Due to the lack of a closed-form density, we implement a Monte Carlo likelihood approximation to enable maximum likelihood estimation and evaluate its performance through simulation studies. To demonstrate practical utility, we apply the NCST model to breast cancer diagnostic data, modeling multiple features of tumor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Gene expression and cancer classification · Radiomics and Machine Learning in Medical Imaging
