Assessing the conditional calibration of interval forecasts using decompositions of the interval score
Sam Allen, Julia Burnello, Johanna Ziegel

TL;DR
This paper introduces a novel method to assess the conditional calibration of interval forecasts by decomposing the interval score, enhancing the evaluation of probabilistic predictions beyond traditional unconditional coverage checks.
Contribution
It proposes a decomposition of the interval score using isotonic distributional regression to accurately evaluate the conditional calibration of interval forecasts.
Findings
Decomposition method effectively assesses conditional calibration.
Method performs well on simulated and real datasets.
Provides a practical tool for forecast evaluation.
Abstract
Forecasts for uncertain future events should be probabilistic. Probabilistic forecasts are commonly issued as prediction intervals, which provide a measure of uncertainty in the unknown outcome whilst being easier to understand and communicate than full predictive distributions. The calibration of a -level prediction interval can be assessed by checking whether the probability that the outcome falls within the interval is equal to . However, such coverage checks are typically unconditional and therefore relatively weak. Although this is well known, there is a lack of methods to assess the conditional calibration of interval forecasts. In this work, we demonstrate how this can be achieved via decompositions of the well-known interval (or Winkler) score. We study notions of calibration for interval forecasts and then introduce a decomposition of the interval…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
