p-value peeking and estimating extrema

Akshay Balsubramani

arXiv:2011.01343·math.ST·November 4, 2020

p-value peeking and estimating extrema

Akshay Balsubramani

PDF

Open Access

TL;DR

This paper addresses the bias introduced by data peeking in statistical hypothesis testing by developing methods to estimate the true extrema of test statistics, improving the accuracy of p-values.

Contribution

It introduces principled mechanisms to estimate running extrema of test statistics, directly tackling the bias caused by peeking in various scenarios.

Findings

01

Methods effectively estimate true extrema despite peeking.

02

Approach reduces bias in p-value reporting.

03

Applicable to multiple testing scenarios.

Abstract

A pervasive issue in statistical hypothesis testing is that the reported $p$ -values are biased downward by data "peeking" -- the practice of reporting only progressively extreme values of the test statistic as more data samples are collected. We develop principled mechanisms to estimate such running extrema of test statistics, which directly address the effect of peeking in some general scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Gaussian Processes and Bayesian Inference · Statistical Mechanics and Entropy