Ensemble size effects on conditional reliability estimates: slope attenuation bias and correction methods
Jonas Spaeth, Christopher D. Roberts

TL;DR
This paper investigates how finite ensemble sizes bias conditional reliability diagnostics in ensemble forecasting, deriving correction methods and illustrating their importance with temperature forecast data.
Contribution
It introduces a unified framework for understanding and correcting slope attenuation bias caused by finite ensemble sampling noise in reliability assessments.
Findings
Finite ensemble sizes systematically bias conditional reliability diagnostics.
Analytical expressions for attenuation are derived and practical estimators are proposed.
Finite-ensemble effects significantly impact spread-error relationships and reliability diagrams.
Abstract
The goal of ensemble forecasting is to maximise sharpness subject to reliability. Marginal reliability means that, over all cases, the ensemble is statistically consistent with reality: the ensemble mean is unbiased, the expected ensemble variance equals the expected mean-squared error of the ensemble mean, and the variance of the ensemble members matches the variance of the truth. Equivalently, forecasts that assign probability to an event verify with relative frequency . However, climatological consistency is not sufficient for users acting on individual forecasts. A natural extension is to assess reliability conditional on the forecast itself, by examining whether, on average, larger ensemble means imply larger observed values, larger spreads imply larger forecast errors, or higher probabilities imply higher event frequencies. This motivates conditional reliability diagnostics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
