Out-of-Distribution Detection Methods Answer the Wrong Questions
Yucen Lily Li, Daohan Lu, Polina Kirichenko, Shikai Qiu, Tim G. J. Rudner, C. Bayan Bruss, Andrew Gordon Wilson

TL;DR
This paper critically examines popular out-of-distribution detection methods, revealing they fundamentally answer the wrong questions and are limited by inherent misalignments in their objectives, leading to irreducible errors.
Contribution
It provides a critical analysis of existing OOD detection methods, highlighting their fundamental misalignments and limitations, and discusses why common interventions fail to resolve these issues.
Findings
Uncertainty-based methods conflate high uncertainty with OOD detection.
Feature-based methods conflate large feature-space distance with OOD.
Interventions like feature-logit hybrids and outlier exposure do not fix core misalignments.
Abstract
To detect distribution shifts and improve model safety, many out-of-distribution (OOD) detection methods rely on the predictive uncertainty or features of supervised models trained on in-distribution data. In this paper, we critically re-examine this popular family of OOD detection procedures, and we argue that these methods are fundamentally answering the wrong questions for OOD detection. There is no simple fix to this misalignment, since a classifier trained only on in-distribution classes cannot be expected to identify OOD points; for instance, a cat-dog classifier may confidently misclassify an airplane if it contains features that distinguish cats from dogs, despite generally appearing nothing alike. We find that uncertainty-based methods incorrectly conflate high uncertainty with being OOD, while feature-based methods incorrectly conflate far feature-space distance with being…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Data Stream Mining Techniques
