Steady Continuous Monitoring is (Just Barely) Impossible for Tests of Unbounded Length
Eric Bax, Alex Shtoff

TL;DR
This paper investigates the challenges of continuous monitoring in AB testing, demonstrating that maintaining a constant significance requirement over unbounded test lengths is impossible, but near-constant control can be achieved with repeated significance confirmation.
Contribution
It introduces a framework showing the fundamental limitations of continuous monitoring in unbounded AB tests and proposes methods to approximate constant significance requirements.
Findings
Constant significance requirements are impossible for unbounded tests.
Repeated significance confirmation can approximate constant significance levels.
The proposed methods approach the ideal of steady continuous monitoring.
Abstract
AB testing evaluates the difference between a control and a treatment in a statistically rigorous manner. Continuous monitoring allows statistical evaluation of an AB test as it proceeds. One goal of continuous monitoring is early stopping -- confirming a statistically significant difference between control and treatment as soon as possible. Another goal is to maintain some statistical capability to discover significant differences later in the test if they cannot be confirmed earlier. These goals are in conflict -- looser requirements for early stopping leave us with more stringent ones for later. This paper shows that it is impossible to maintain a constant requirement for significance for tests that have no a priori stopping time, but we can come arbitrarily close to that goal by using tests that require repeated significant results to con rm statistically significant differences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
