Combining individually valid and conditionally i.i.d. P-variables
Lutz Mattner

TL;DR
This paper develops a method to combine valid P-variables, especially order statistics, into a single valid P-variable, with applications demonstrated in biological association studies.
Contribution
It introduces the function $f_{n,k}$ for optimally combining order statistic P-variables into a valid summary, extending the theory of P-value combination under specific independence conditions.
Findings
The function $f_{n,k}$ is the smallest increasing function ensuring validity.
An approximation for $f_{n,k}$ is given by $1 \, \wedge \, \frac{n}{k} u$ for large $k$.
Application to primate association data demonstrates practical utility.
Abstract
For a given testing problem, let be individually valid and conditionally on the data i.i.d.\ P-variables (often called P-values). For example, the data could come in groups, and each could be based on subsampling just one datum from each group in order to satisfy an independence assumption under the hypothesis. The problem is then to deterministically combine the into a valid summary P-variable. Restricting here our attention to functions of a given order statistic of the , we compute the function which is smallest among all increasing functions such that is always a valid P-variable under the stated assumptions. Since , with the right hand side being a good approximation for the left when is large, one may in particular always take the minimum of 1 and twice the left sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning and Algorithms · Digital Image Processing Techniques
