Extreme-Path Benchmarks for Sequential Probability Forecasts
Jonathan Pipping-Gam\'on, Abraham J. Wyner

TL;DR
This paper introduces benchmark distributions for extreme-path functionals in sequential probability forecasts, enabling better diagnosis of miscalibration in real-time binary outcome predictions.
Contribution
It derives exact and finite-sample benchmark distributions for extreme forecast paths, providing model-agnostic null targets for calibration diagnostics.
Findings
Exact benchmark in continuous time for peak-on-loss
Finite-sample bounds with correction for discrete time
Application to ESPN sports data shows broad agreement and departures
Abstract
Real-time probability forecasts for binary outcomes are routine in sports, online experimentation, medicine, and finance. Retrospective narratives, however, often hinge on pathwise extremes: for example, a forecast that becomes "90% certain" for an event that ultimately does not occur. Standard pointwise calibration tools do not quantify how frequently such extremes should arise under correct sequential calibration, where the ideal forecast sequence is a bounded martingale that ends at the realized outcome. We derive benchmark distributions for extreme-path functionals conditional on the terminal outcome, emphasizing the peak-on-loss: the largest forecast value attained along realizations that end in failure. In continuous time with continuous paths we obtain an exact closed-form benchmark; in discrete time we prove sharp finite-sample bounds together with an explicit correction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
