Multiple testing with anytime-valid Monte Carlo p-values
Lasse Fischer, Timothy Barry, Aaditya Ramdas

TL;DR
This paper introduces a method that integrates anytime-valid Monte Carlo p-values into multiple testing procedures, significantly reducing computational effort while maintaining false discovery rate control in high-dimensional data analysis.
Contribution
It develops a data-adaptive permutation testing framework that controls FDR and reduces computational time in large-scale hypothesis testing.
Findings
Reduces permutation testing time from days to minutes.
Maintains FDR control despite data-dependent p-value stopping.
Increases number of rejections in genomics data analysis.
Abstract
In contemporary problems involving genetic or neuroimaging data, thousands of hypotheses need to be tested. Due to their high power, and finite sample guarantees on type-I error under weak assumptions, Monte Carlo permutation tests are often considered as gold standard for these settings. However, the enormous computational effort required for (thousands of) permutation tests is a major burden. In this paper, we integrate recently constructed anytime-valid permutation p-values into a broad class of multiple testing procedures, including the Benjamini-Hochberg procedure. This allows to fully adapt the number of permutations to the underlying data and thus, for example, to the number of rejections made by the multiple testing procedure. Even though this data-adaptive stopping can induce dependencies between the p-values that violate the usual assumptions of the Benjamini-Hochberg…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials
