Bagged Empirical Null p-values: A Method to Account for Model Uncertainty in Large Scale Inference
Sarah Fletcher Mercaldo, Jeffrey D. Blume

TL;DR
This paper introduces bagged empirical null p-values (BEN p-values) that incorporate model uncertainty through bootstrap aggregation, improving large-scale inference accuracy and reproducibility in high-dimensional studies.
Contribution
The authors propose a novel method combining empirical null p-values with bootstrap aggregation to enhance inference robustness and reproducibility in large-scale testing.
Findings
BEN p-values often significantly change the ranking of significant findings.
Re-analysis of leukemia data revealed new findings supported by prior biological evidence.
Simulation studies demonstrate BEN p-values outperform existing methods in stability and accuracy.
Abstract
When conducting large scale inference, such as genome-wide association studies or image analysis, nominal -values are often adjusted to improve control over the family-wise error rate (FWER). When the majority of tests are null, procedures controlling the False discovery rate (Fdr) can be improved by replacing the theoretical global null with its empirical estimate. However, these other adjustment procedures remain sensitive to the working model assumption. Here we propose two key ideas to improve inference in this space. First, we propose -values that are standardized to the empirical null distribution (instead of the theoretical null). Second, we propose model averaging -values by bootstrap aggregation (Bagging) to account for model uncertainty and selection procedures. The combination of these two key ideas yields bagged empirical null -values (BEN -values) that often…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
