Flexible control of the median of the false discovery proportion

Jesse Hemerik; Aldo Solari; Jelle J Goeman

arXiv:2208.11570·stat.ME·March 14, 2024

Flexible control of the median of the false discovery proportion

Jesse Hemerik, Aldo Solari, Jelle J Goeman

PDF

Open Access

TL;DR

This paper presents a new multiple testing procedure that controls the median of the false discovery proportion, offering greater flexibility and validity for finite samples without assuming independence, compared to traditional methods.

Contribution

The authors introduce a median FDP control method that is flexible post hoc and does not assume independence, extending the capabilities of existing procedures like Benjamini-Hochberg.

Findings

01

Controls median FDP effectively in simulations

02

Allows flexible alpha selection after data viewing

03

Operates with linear time complexity

Abstract

We introduce a multiple testing procedure that controls the median of the proportion of false discoveries (FDP) in a flexible way. The procedure only requires a vector of p-values as input and is comparable to the Benjamini-Hochberg method, which controls the mean of the FDP. Our method allows freely choosing one or several values of alpha after seeing the data -- unlike Benjamini-Hochberg, which can be very liberal when alpha is chosen post hoc. We prove these claims and illustrate them with simulations. Our procedure is inspired by a popular estimator of the total number of true hypotheses. We adapt this estimator to provide simultaneously median unbiased estimators of the FDP, valid for finite samples. This simultaneity allows for the claimed flexibility. Our approach does not assume independence. The time complexity of our method is linear in the number of hypotheses, after sorting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Bayesian Modeling and Causal Inference