Decomposing Probabilistic Scores: Reliability, Information Loss and Uncertainty
Arthur Charpentier, Agathe Fernandes Machado

TL;DR
This paper introduces a decomposition framework for probabilistic scores that explicitly separates reliability, information loss, and residual uncertainty, providing new insights into calibration and model aggregation.
Contribution
It develops decomposition identities for proper losses, linking calibration, information gain, and uncertainty, with applications to recalibration and model combination.
Findings
Decomposition identities for proper losses elucidate calibration and uncertainty.
Chain decomposition quantifies information gain between feature levels.
Explicit forms for Brier and log-loss in model aggregation and recalibration.
Abstract
Calibration is a conditional property that depends on the information retained by a predictor. We develop decomposition identities for arbitrary proper losses that make this dependence explicit. At any information level , the expected loss of an -measurable predictor splits into a proper-regret (reliability) term and a conditional entropy (residual uncertainty) term. For nested levels , a chain decomposition quantifies the information gain from to . Applied to classification with features and score , this yields a three-term identity: miscalibration, a {\em grouping} term measuring information loss from to , and irreducible uncertainty at the feature level. We leverage the framework to analyze post-hoc recalibration, aggregation of calibrated models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI)
