Compound p-Value Statistics for Multiple Testing Procedures
Joshua D. Habiger, Edsel A. Pena

TL;DR
This paper introduces compound p-value statistics that utilize all available data, improving the power of multiple testing procedures while maintaining control over false positives, demonstrated through theoretical analysis, simulations, and real data application.
Contribution
It develops a new class of compound p-value statistics that depend on all data and satisfy independence and uniformity under nulls, enhancing testing power.
Findings
Compound p-values increase detection of false nulls.
Procedures maintain control of type I error rate.
Application to microarray data shows improved power.
Abstract
Many multiple testing procedures make use of the p-values from the individual pairs of hypothesis tests, and are valid if the p-value statistics are independent and uniformly distributed under the null hypotheses. However, it has recently been shown that these types of multiple testing procedures are inefficient since such p-values do not depend upon all of the available data. This paper provides tools for constructing compound p-value statistics, which are those that depend upon all of the available data, but still satisfy the conditions of independence and uniformity under the null hypotheses. As an example, a class of compound p-value statistics for testing for location shifts is developed. It is demonstrated, both analytically and through simulations, that multiple testing procedures tend to reject more false null hypotheses when applied to these compound p-values rather than the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Gene expression and cancer classification · Optimal Experimental Design Methods
