Accumulation tests for FDR control in ordered hypothesis testing
Ang Li, Rina Foygel Barber

TL;DR
This paper introduces accumulation tests, including a new HingeExp method, for ordered hypothesis testing to improve detection power while controlling FDR, applicable to large-scale scientific data analysis.
Contribution
The paper develops a family of accumulation tests with a novel HingeExp method that adaptively controls FDR and enhances power in ordered multiple testing scenarios.
Findings
HingeExp outperforms existing methods in simulations.
The methods control a modified FDR in finite samples.
Application to real data demonstrates improved gene expression detection.
Abstract
Multiple testing problems arising in modern scientific applications can involve simultaneously testing thousands or even millions of hypotheses, with relatively few true signals. In this paper, we consider the multiple testing problem where prior information is available (for instance, from an earlier study under different experimental conditions), that can allow us to test the hypotheses as a ranked list in order to increase the number of discoveries. Given an ordered list of n hypotheses, the aim is to select a data-dependent cutoff k and declare the first k hypotheses to be statistically significant while bounding the false discovery rate (FDR). Generalizing several existing methods, we develop a family of "accumulation tests" to choose a cutoff k that adapts to the amount of signal at the top of the ranked list. We introduce a new method in this family, the HingeExp method, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Gene expression and cancer classification · Statistical Methods and Inference
