Understanding the Feature Norm for Out-of-Distribution Detection
Jaewoo Park, Jacky Chen Long Chai, Jaeho Yoon, Andrew Beng Jin Teoh

TL;DR
This paper investigates why neural network feature norms differ between in-distribution and out-of-distribution samples, revealing their relation to classifier confidence and proposing a new norm to improve OOD detection.
Contribution
The study uncovers the link between feature norm and classifier confidence, introduces a class-agnostic norm, and proposes a novel negative-aware norm for better OOD detection.
Findings
Feature norm correlates with classifier confidence (max logit).
Feature norm can detect OOD samples across models.
Negative-aware norm improves OOD detection performance.
Abstract
A neural network trained on a classification dataset often exhibits a higher vector norm of hidden layer features for in-distribution (ID) samples, while producing relatively lower norm values on unseen instances from out-of-distribution (OOD). Despite this intriguing phenomenon being utilized in many applications, the underlying cause has not been thoroughly investigated. In this study, we demystify this very phenomenon by scrutinizing the discriminative structures concealed in the intermediate layers of a neural network. Our analysis leads to the following discoveries: (1) The feature norm is a confidence value of a classifier hidden in the network layer, specifically its maximum logit. Hence, the feature norm distinguishes OOD from ID in the same manner that a classifier confidence does. (2) The feature norm is class-agnostic, thus it can detect OOD samples across diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Understanding the Feature Norm for Out-of-Distribution Detection· youtube
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
