Differential gene expression analysis via two-component mixture models with a semiparametric skew-normal scale mixture alternative
Sangkon Oh, Geoffrey J. McLachlan

TL;DR
This paper introduces a flexible semiparametric mixture model for differential gene expression analysis that effectively handles skewness and heavy tails, improving accuracy over traditional methods.
Contribution
It proposes a novel semiparametric mixture model with an unspecified scale distribution, enhancing robustness and flexibility in gene expression studies.
Findings
Improved detection accuracy over existing methods.
Reduced false discovery and false negative rates.
Model is identifiable and computationally efficient.
Abstract
Two-component mixture models are particularly useful for identifying differentially expressed genes, but their performance can deteriorate markedly when the alternative distribution departs from parametric assumptions or symmetry. We propose a semiparametric mixture model in which the null component is standard normal and the alternative follows a skew-normal scale mixture with an unspecified scale mixing distribution. This formulation accommodates skewness and heavy tails, providing a flexible and computationally tractable tool for differential gene-expression analysis without restrictive distributional assumptions. We establish identifiability and consistency of the model and develop an efficient estimation algorithm that incorporates nonparametric maximum likelihood estimation of the scale distribution. Numerical studies show notable improvements over existing parametric and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Gene expression and cancer classification
