Set-based differential covariance testing for high-throughput data
Yi-Hui Zhou

TL;DR
This paper introduces a flexible, high-dimensional framework for testing the association between sets of features' covariance matrices and experimental variables, applicable to both discrete and continuous data.
Contribution
It proposes a novel, general framework with four summary statistics, including a new connectivity statistic, for covariance testing in high-dimensional settings, extending beyond two-sample problems.
Findings
The approach is applicable when p >> n.
Asymptotic p-values are derived for several statistics.
The methods show improved power in simulations and real data analysis.
Abstract
The problem of detecting changes in covariance for a single pair of features has been studied in some detail, but may be limited in importance or general applicability. In contrast, testing equality of covariance matrices of a {\it set} of features may offer increased power and interpretability. Such approaches have received increasing attention in recent years, especially in the context of high-dimensional testing. These approaches have been limited to the two-sample problem and involve varying assumptions on the number of features vs. the sample size . In addition, there has been little discussion of the motivating principles underlying various choices of statistic, and no general approaches to test association of covariances with a continuous outcome. We propose a uniform framework to test association of covariance matrices with an experimental variable, whether discrete or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Statistical Methods in Clinical Trials
