Useful Confidence Measures: Beyond the Max Score

Gal Yona; Amir Feder; Itay Laish

arXiv:2210.14070·cs.LG·October 26, 2022

Useful Confidence Measures: Beyond the Max Score

Gal Yona, Amir Feder, Itay Laish

PDF

Open Access

TL;DR

This paper explores confidence measures beyond the maximum score in ML classifiers, demonstrating that entropy-based measures provide more reliable confidence estimates, especially for NLP models under distribution shifts.

Contribution

It introduces and empirically evaluates confidence measures that utilize information beyond the max score, highlighting the effectiveness of entropy-based confidence in NLP tasks.

Findings

01

Max score confidence is suboptimal for out-of-distribution detection

02

Entropy-based confidence measures outperform max score in various settings

03

Post-processing improves confidence estimates but does not eliminate the benefits of entropy-based measures

Abstract

An important component in deploying machine learning (ML) in safety-critic applications is having a reliable measure of confidence in the ML model's predictions. For a classifier $f$ producing a probability vector $f (x)$ over the candidate classes, the confidence is typically taken to be $max_{i} f (x)_{i}$ . This approach is potentially limited, as it disregards the rest of the probability vector. In this work, we derive several confidence measures that depend on information beyond the maximum score, such as margin-based and entropy-based measures, and empirically evaluate their usefulness, focusing on NLP tasks with distribution shifts and Transformer-based models. We show that when models are evaluated on the out-of-distribution data ``out of the box'', using only the maximum score to inform the confidence measure is highly suboptimal. In the post-processing regime (where the scores of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification