Early Stopping Based on Repeated Significance

Eric Bax; Arundhyoti Sarkar; and Alex Shtoff

arXiv:2408.00908·stat.ME·August 5, 2024

Early Stopping Based on Repeated Significance

Eric Bax, Arundhyoti Sarkar, and Alex Shtoff

PDF

Open Access

TL;DR

This paper proposes a method for early stopping in statistical tests by requiring repeated significance at multiple decision points, balancing confidence levels with practical testing constraints.

Contribution

It introduces a novel approach to early stopping that uses repeated significance criteria to maintain statistical confidence without overly strict p-value requirements.

Findings

01

Requiring success at multiple decision points improves early stopping reliability.

02

The method balances confidence levels with practical testing constraints.

03

It extends traditional significance testing to sequential decision-making.

Abstract

For a bucket test with a single criterion for success and a fixed number of samples or testing period, requiring a $p$ -value less than a specified value of $α$ for the success criterion produces statistical confidence at level $1 - α$ . For multiple criteria, a Bonferroni correction that partitions $α$ among the criteria produces statistical confidence, at the cost of requiring lower $p$ -values for each criterion. The same concept can be applied to decisions about early stopping, but that can lead to strict requirements for $p$ -values. We show how to address that challenge by requiring criteria to be successful at multiple decision points.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsControl Systems and Identification · Advanced Statistical Process Monitoring · Fault Detection and Control Systems