On the Calibration of Probabilistic Classifier Sets
Thomas Mortier, Viktor Bengs, Eyke H\"ullermeier, Stijn Luca, and Willem Waegeman

TL;DR
This paper extends calibration concepts to sets of probabilistic classifiers, proposing a new test to evaluate their epistemic uncertainty, and finds that deep neural network ensembles often lack proper calibration.
Contribution
It introduces a novel calibration notion for classifier sets and develops a nonparametric test to assess their epistemic uncertainty calibration.
Findings
Ensembles of deep neural networks are often poorly calibrated.
The proposed test effectively evaluates calibration of classifier sets.
The method generalizes existing calibration tests for single classifiers.
Abstract
Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble learning methods, are able to model aleatoric and epistemic uncertainty. Aleatoric uncertainty is then typically quantified via the Bayes error, and epistemic uncertainty via the size of the set. In this paper, we extend the notion of calibration, which is commonly used to evaluate the validity of the aleatoric uncertainty representation of a single probabilistic classifier, to assess the validity of an epistemic uncertainty representation obtained by sets of probabilistic classifiers. Broadly speaking, we call a set of probabilistic classifiers calibrated if one can find a calibrated convex combination of these classifiers. To evaluate this notion of calibration, we propose a novel nonparametric calibration test that generalizes an existing test for single probabilistic classifiers to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
