Combining p-values via averaging
Vladimir Vovk, Ruodu Wang

TL;DR
This paper introduces new methods for combining multiple p-values in hypothesis testing without dependence assumptions, extending classical results to generalized means and discussing their efficiency and application.
Contribution
It extends classical p-value combination methods to generalized means using recent mathematical finance techniques, including weighted averages and efficiency considerations.
Findings
Harmonic mean combination scaled by ln K is effective for large K.
Generalized means provide flexible p-value combination strategies.
The methods improve robustness in multiple testing scenarios.
Abstract
This paper proposes general methods for the problem of multiple testing of a single hypothesis, with a standard goal of combining a number of p-values without making any assumptions about their dependence structure. An old result by R\"uschendorf and, independently, Meng implies that the p-values can be combined by scaling up their arithmetic mean by a factor of 2 (and no smaller factor is sufficient in general). A similar result about the geometric mean (Mattner) replaces 2 by . Based on more recent developments in mathematical finance, specifically, robust risk aggregation techniques, we extend these results to generalized means; in particular, we show that p-values can be combined by scaling up their harmonic mean by a factor of (asymptotically as ). This leads to a generalized version of the Bonferroni-Holm procedure. We also explore methods using weighted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Advanced Statistical Methods and Models · Optimal Experimental Design Methods
