Are Out-of-Distribution Detection Methods Effective on Large-Scale Datasets?
Ryne Roady, Tyler L. Hayes, Ronald Kemker, Ayesha Gonzales, and, Christopher Kanan

TL;DR
This paper evaluates out-of-distribution detection methods on large-scale datasets like ImageNet, finding that input perturbation and temperature scaling are most effective, while feature space regularization has limited practicality at scale.
Contribution
It systematically compares inference and regularization approaches for out-of-distribution detection on large datasets, identifying optimal combinations and practical insights.
Findings
Input perturbation and temperature scaling perform best on large datasets.
Feature space regularization can help if a suitable background class is available.
Regularization against a background class is impractical for large-scale datasets.
Abstract
Supervised classification methods often assume the train and test data distributions are the same and that all classes in the test set are present in the training set. However, deployed classifiers often require the ability to recognize inputs from outside the training set as unknowns. This problem has been studied under multiple paradigms including out-of-distribution detection and open set recognition. For convolutional neural networks, there have been two major approaches: 1) inference methods to separate knowns from unknowns and 2) feature space regularization strategies to improve model robustness to outlier inputs. There has been little effort to explore the relationship between the two approaches and directly compare performance on anything other than small-scale datasets that have at most 100 categories. Using ImageNet-1K and Places-434, we identify novel combinations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
