An Effective Baseline for Robustness to Distributional Shift

Sunil Thulasidasan; Sushil Thapa; Sayera Dhaubhadel; Gopinath; Chennupati; Tanmoy Bhattacharya; Jeff Bilmes

arXiv:2105.07107·cs.LG·May 18, 2021

An Effective Baseline for Robustness to Distributional Shift

Sunil Thulasidasan, Sushil Thapa, Sayera Dhaubhadel, Gopinath, Chennupati, Tanmoy Bhattacharya, Jeff Bilmes

PDF

1 Repo

TL;DR

This paper introduces a simple yet effective out-of-distribution detection method using an abstention class, trained with augmented data, outperforming complex existing approaches across various benchmarks.

Contribution

The paper proposes a straightforward abstention-based approach with an extra class, serving as a new strong baseline for out-of-distribution detection in deep learning.

Findings

01

Outperforms many complex methods on benchmarks

02

Effective across image and text classification tasks

03

Simple to implement and train

Abstract

Refraining from confidently predicting when faced with categories of inputs different from those seen during training is an important requirement for the safe deployment of deep learning systems. While simple to state, this has been a particularly challenging problem in deep learning, where models often end up making overconfident predictions in such situations. In this work we present a simple, but highly effective approach to deal with out-of-distribution detection that uses the principle of abstention: when encountering a sample from an unseen class, the desired behavior is to abstain from predicting. Our approach uses a network with an extra abstention class and is trained on a dataset that is augmented with an uncurated set that consists of a large number of out-of-distribution (OoD) samples that are assigned the label of the abstention class; the model is then trained to learn an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Sushil-Thapa/Abstention-OoD
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAverage Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Residual Connection · Batch Normalization · Gated Recurrent Unit · Wide Residual Block · Global Average Pooling · Kaiming Initialization