A Baseline for Detecting Misclassified and Out-of-Distribution Examples   in Neural Networks

Dan Hendrycks; Kevin Gimpel

arXiv:1610.02136·cs.NE·October 4, 2018·1.6k cites

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Dan Hendrycks, Kevin Gimpel

PDF

Open Access 5 Repos

TL;DR

This paper introduces a simple softmax-based baseline for detecting misclassified and out-of-distribution examples across various AI tasks, demonstrating its effectiveness and highlighting potential for future improvements.

Contribution

The paper proposes a straightforward softmax probability-based method for identifying misclassified and OOD samples, providing a baseline for future research in this area.

Findings

01

Effective detection across vision, NLP, and speech tasks

02

Baseline can be surpassed with advanced methods

03

Room for future research in detection tasks

Abstract

We consider the two related problems of detecting if an example is misclassified or out-of-distribution. We present a simple baseline that utilizes probabilities from softmax distributions. Correctly classified examples tend to have greater maximum softmax probabilities than erroneously classified and out-of-distribution examples, allowing for their detection. We assess performance by defining several tasks in computer vision, natural language processing, and automatic speech recognition, showing the effectiveness of this baseline across all. We then show the baseline can sometimes be surpassed, demonstrating the room for future research on these underexplored detection tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling