A regularized MANOVA test for semicontinuous high-dimensional data
Elena Sabbioni, Claudio Agostinelli, Alessio Farcomeni

TL;DR
This paper introduces a regularized MANOVA test tailored for high-dimensional semicontinuous data, enabling hypothesis testing when traditional methods are infeasible due to dimensionality constraints.
Contribution
It develops a novel likelihood ratio-based test with closed-form regularized estimators suitable for high-dimensional semicontinuous data, validated through simulations and real data applications.
Findings
Test maintains appropriate level in simulations
Demonstrates effectiveness on microRNA data
Successfully applied to ecological invasion data
Abstract
We propose a MANOVA test for semicontinuous data that is applicable also when the dimensionality exceeds the sample size. The test statistic is obtained as a likelihood ratio, where numerator and denominator are computed at the maxima of penalized likelihood functions under each hypothesis. Closed form solutions for the regularized estimators allow us to avoid computational overheads. We derive the null distribution using a permutation scheme. The power and level of the resulting test are evaluated in a simulation study. We illustrate the new methodology with two original data analyses, one regarding microRNA expression in human blastocyst cultures, and another regarding alien plant species invasion in the island of Socotra (Yemen).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPharmacological Effects of Medicinal Plants
