Scoring Rules and Calibration for Imprecise Probabilities

Christian Fr\"ohlich; Robert C. Williamson

arXiv:2410.23001·cs.LG·October 31, 2024

Scoring Rules and Calibration for Imprecise Probabilities

Christian Fr\"ohlich, Robert C. Williamson

PDF

Open Access 1 Repo

TL;DR

This paper extends the concepts of proper scoring rules and calibration from precise to imprecise probabilistic forecasts, providing a theoretical framework and practical insights for decision-making under uncertainty.

Contribution

It generalizes scoring rules and calibration for imprecise probabilities, linking them to distributional robustness and highlighting their distinct roles.

Findings

01

Proper scoring rules and calibration are not necessarily aligned in the imprecise case.

02

The concept of decision-theoretic entropy is central to both scoring and calibration.

03

Illustrates pitfalls in loss function choices in machine learning distributional robustness.

Abstract

What does it mean to say that, for example, the probability for rain tomorrow is between 20% and 30%? The theory for the evaluation of precise probabilistic forecasts is well-developed and is grounded in the key concepts of proper scoring rules and calibration. For the case of imprecise probabilistic forecasts (sets of probabilities), such theory is still lacking. In this work, we therefore generalize proper scoring rules and calibration to the imprecise case. We develop these concepts as relative to data models and decision problems. As a consequence, the imprecision is embedded in a clear context. We establish a close link to the paradigm of (group) distributional robustness and in doing so provide new insights for it. We argue that proper scoring rules and calibration serve two distinct goals, which are aligned in the precise case, but intriguingly are not necessarily aligned in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

froec/ip_scoring_rules_experiments
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference