Assessment of P-value variability in the current replicability crisis
Olga A. Vsevolozhskaya, Gabriel Ruiz, Dmitri V. Zaykin

TL;DR
This paper critically examines the variability of P-values in scientific research, highlighting their unreliability for replication and proposing a new method to better estimate their distribution considering effect size biases.
Contribution
It introduces a novel approach to construct probabilistic bounds for P-values, addressing limitations of existing P-intervals and allowing researchers to control assumptions based on prior effect size knowledge.
Findings
P-values exhibit high variability, questioning their use in replication.
Existing P-intervals are based on unrealistic assumptions leading to bias.
The proposed method provides more reliable bounds for P-values, resistant to selection bias.
Abstract
Increased availability of data and accessibility of computational tools in recent years have created unprecedented opportunities for scientific research driven by statistical analysis. Inherent limitations of statistics impose constrains on reliability of conclusions drawn from data but misuse of statistical methods is a growing concern. Significance, hypothesis testing and the accompanying P-values are being scrutinized as representing most widely applied and abused practices. One line of critique is that P-values are inherently unfit to fulfill their ostensible role as measures of scientific hypothesis's credibility. It has also been suggested that while P-values may have their role as summary measures of effect, researchers underappreciate the degree of randomness in the P-value. High variability of P-values would suggest that having obtained a small P-value in one study, one is,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeta-analysis and systematic reviews · Statistical Methods in Clinical Trials · Explainable Artificial Intelligence (XAI)
