Extreme-Path Benchmarks for Sequential Probability Forecasts

Jonathan Pipping-Gam\'on; Abraham J. Wyner

arXiv:2601.18774·stat.AP·April 3, 2026

Extreme-Path Benchmarks for Sequential Probability Forecasts

Jonathan Pipping-Gam\'on, Abraham J. Wyner

PDF

TL;DR

This paper introduces benchmark distributions for extreme-path functionals in sequential probability forecasts, enabling better diagnosis of miscalibration in real-time binary outcome predictions.

Contribution

It derives exact and finite-sample benchmark distributions for extreme forecast paths, providing model-agnostic null targets for calibration diagnostics.

Findings

01

Exact benchmark in continuous time for peak-on-loss

02

Finite-sample bounds with correction for discrete time

03

Application to ESPN sports data shows broad agreement and departures

Abstract

Real-time probability forecasts for binary outcomes are routine in sports, online experimentation, medicine, and finance. Retrospective narratives, however, often hinge on pathwise extremes: for example, a forecast that becomes "90% certain" for an event that ultimately does not occur. Standard pointwise calibration tools do not quantify how frequently such extremes should arise under correct sequential calibration, where the ideal forecast sequence is a bounded martingale that ends at the realized outcome. We derive benchmark distributions for extreme-path functionals conditional on the terminal outcome, emphasizing the peak-on-loss: the largest forecast value attained along realizations that end in failure. In continuous time with continuous paths we obtain an exact closed-form benchmark; in discrete time we prove sharp finite-sample bounds together with an explicit correction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.