To Trust Or Not To Trust A Classifier

Heinrich Jiang; Been Kim; Melody Y. Guan; Maya Gupta

arXiv:1805.11783·stat.ML·October 30, 2018·185 cites

To Trust Or Not To Trust A Classifier

Heinrich Jiang, Been Kim, Melody Y. Guan, Maya Gupta

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel trust score for classifiers that outperforms traditional confidence scores in identifying correct and incorrect predictions, with theoretical guarantees under certain data assumptions.

Contribution

The paper proposes the trust score, an alternative measure to confidence scores, with empirical and theoretical evidence of its effectiveness and consistency in classification tasks.

Findings

01

Trust score better identifies correct classifications than confidence scores.

02

High trust scores correlate with classifier agreement and correctness.

03

Theoretical guarantees under mild assumptions support trust score reliability.

Abstract

Knowing when a classifier's prediction can be trusted is useful in many applications and critical for safely using AI. While the bulk of the effort in machine learning research has been towards improving classifier performance, understanding when a classifier's predictions should and should not be trusted has received far less attention. The standard approach is to use the classifier's discriminant or confidence score; however, we show there exists an alternative that is more effective in many situations. We propose a new score, called the trust score, which measures the agreement between the classifier and a modified nearest-neighbor classifier on the testing example. We show empirically that high (low) trust scores produce surprisingly high precision at identifying correctly (incorrectly) classified examples, consistently outperforming the classifier's confidence score as well as many…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google/TrustScore
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Statistical Methods and Inference