TL;DR
This paper introduces a simple yet effective out-of-distribution detection method using an abstention class, trained with augmented data, outperforming complex existing approaches across various benchmarks.
Contribution
The paper proposes a straightforward abstention-based approach with an extra class, serving as a new strong baseline for out-of-distribution detection in deep learning.
Findings
Outperforms many complex methods on benchmarks
Effective across image and text classification tasks
Simple to implement and train
Abstract
Refraining from confidently predicting when faced with categories of inputs different from those seen during training is an important requirement for the safe deployment of deep learning systems. While simple to state, this has been a particularly challenging problem in deep learning, where models often end up making overconfident predictions in such situations. In this work we present a simple, but highly effective approach to deal with out-of-distribution detection that uses the principle of abstention: when encountering a sample from an unseen class, the desired behavior is to abstain from predicting. Our approach uses a network with an extra abstention class and is trained on a dataset that is augmented with an uncurated set that consists of a large number of out-of-distribution (OoD) samples that are assigned the label of the abstention class; the model is then trained to learn an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAverage Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Residual Connection · Batch Normalization · Gated Recurrent Unit · Wide Residual Block · Global Average Pooling · Kaiming Initialization
