Beyond temperature scaling: Obtaining well-calibrated multiclass   probabilities with Dirichlet calibration

Meelis Kull; Miquel Perello-Nieto; Markus K\"angsepp; Telmo Silva; Filho; Hao Song; Peter Flach

arXiv:1910.12656·cs.LG·October 29, 2019·163 cites

Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

Meelis Kull, Miquel Perello-Nieto, Markus K\"angsepp, Telmo Silva, Filho, Hao Song, Peter Flach

PDF

Open Access 3 Repos

TL;DR

This paper introduces Dirichlet calibration, a new multiclass calibration method that improves probability estimates across various models and datasets by generalizing binary beta calibration to the multiclass setting.

Contribution

It presents a natively multiclass calibration technique based on Dirichlet distributions, extending beta calibration, and demonstrates its effectiveness over existing methods.

Findings

01

Improved calibration metrics across multiple datasets and classifiers.

02

Easy implementation with neural networks via log-transform and linear layer.

03

Provides insights into model biases through Dirichlet parameters.

Abstract

Class probabilities predicted by most multiclass classifiers are uncalibrated, often tending towards over-confidence. With neural networks, calibration can be improved by temperature scaling, a method to learn a single corrective multiplicative factor for inputs to the last softmax layer. On non-neural models the existing methods apply binary calibration in a pairwise or one-vs-rest fashion. We propose a natively multiclass calibration method applicable to classifiers from any model class, derived from Dirichlet distributions and generalising the beta calibration method from binary classification. It is easily implemented with neural nets since it is equivalent to log-transforming the uncalibrated probabilities, followed by one linear layer and softmax. Experiments demonstrate improved probabilistic predictions according to multiple measures (confidence-ECE, classwise-ECE, log-loss,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Neural Networks and Applications · Explainable Artificial Intelligence (XAI)

MethodsLinear Layer · Softmax