Out-of-Distribution Detection Methods Answer the Wrong Questions

Yucen Lily Li; Daohan Lu; Polina Kirichenko; Shikai Qiu; Tim G. J. Rudner; C. Bayan Bruss; Andrew Gordon Wilson

arXiv:2507.01831·cs.LG·July 3, 2025

Out-of-Distribution Detection Methods Answer the Wrong Questions

Yucen Lily Li, Daohan Lu, Polina Kirichenko, Shikai Qiu, Tim G. J. Rudner, C. Bayan Bruss, Andrew Gordon Wilson

PDF

Open Access

TL;DR

This paper critically examines popular out-of-distribution detection methods, revealing they fundamentally answer the wrong questions and are limited by inherent misalignments in their objectives, leading to irreducible errors.

Contribution

It provides a critical analysis of existing OOD detection methods, highlighting their fundamental misalignments and limitations, and discusses why common interventions fail to resolve these issues.

Findings

01

Uncertainty-based methods conflate high uncertainty with OOD detection.

02

Feature-based methods conflate large feature-space distance with OOD.

03

Interventions like feature-logit hybrids and outlier exposure do not fix core misalignments.

Abstract

To detect distribution shifts and improve model safety, many out-of-distribution (OOD) detection methods rely on the predictive uncertainty or features of supervised models trained on in-distribution data. In this paper, we critically re-examine this popular family of OOD detection procedures, and we argue that these methods are fundamentally answering the wrong questions for OOD detection. There is no simple fix to this misalignment, since a classifier trained only on in-distribution classes cannot be expected to identify OOD points; for instance, a cat-dog classifier may confidently misclassify an airplane if it contains features that distinguish cats from dogs, despite generally appearing nothing alike. We find that uncertainty-based methods incorrectly conflate high uncertainty with being OOD, while feature-based methods incorrectly conflate far feature-space distance with being…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Data Stream Mining Techniques