On the Impact of Spurious Correlation for Out-of-distribution Detection
Yifei Ming, Hang Yin, Yixuan Li

TL;DR
This paper formalizes the impact of spurious correlations on out-of-distribution detection in neural networks, revealing that increased spurious feature-label correlation worsens detection performance and providing insights for more robust methods.
Contribution
It introduces a new formalization of OOD considering invariant and spurious features and analyzes how spurious correlations affect detection performance.
Findings
Higher spurious correlation worsens OOD detection accuracy.
Detection methods can be improved by reducing reliance on environmental features.
Theoretical analysis explains why environmental features lead to high detection error.
Abstract
Modern neural networks can assign high confidence to inputs drawn from outside the training distribution, posing threats to models in real-world deployments. While much research attention has been placed on designing new out-of-distribution (OOD) detection methods, the precise definition of OOD is often left in vagueness and falls short of the desired notion of OOD in reality. In this paper, we present a new formalization and model the data shifts by taking into account both the invariant and environmental (spurious) features. Under such formalization, we systematically investigate how spurious correlation in the training set impacts OOD detection. Our results suggest that the detection performance is severely worsened when the correlation between spurious features and labels is increased in the training set. We further show insights on detection methods that are more effective in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Fire Detection and Safety Systems
