Testing a Large Number of Composite Null Hypotheses Using Conditionally Symmetric Multidimensional Gaussian Mixtures in Genome-Wide Studies
Ryan Sun, Zachary McCaw, Xihong Lin

TL;DR
This paper introduces the csmGmm framework for large-scale testing of composite null hypotheses in genetic studies, unifying Bayesian and frequentist methods with improved robustness and interpretability.
Contribution
The work develops the conditionally symmetric multidimensional Gaussian mixture model (csmGmm), unifying lfdr and z-statistic testing rules across multiple genetic analysis settings.
Findings
csmGmm outperforms recent alternatives in robustness
It ensures lfdr and z-statistic testing rule agreement
Applied successfully to lung cancer genetic studies
Abstract
Causal mediation analysis, pleiotropy analysis, and replication analysis are three highly popular genetic study designs. Although these analyses address different scientific questions, the underlying inference problems all involve large-scale testing of composite null hypotheses. The goal is to determine whether all null hypotheses - as opposed to at least one - in a set of individual tests should simultaneously be rejected. Various recent methodology has been proposed for the aforementioned situations, and an appealing empirical Bayes strategy is to apply the popular two-group model, calculating local false discovery rates (lfdr) for each set of hypotheses. However, such a strategy is difficult due to the need for multivariate density estimation. Furthermore, the multiple testing rules for the empirical Bayes lfdr approach and conventional frequentist z-statistics can disagree, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Statistical Methods in Clinical Trials · Genetic Associations and Epidemiology
