A Systematic Analysis of Out-of-Distribution Detection Under Representation and Training Paradigm Shifts
Claudio C\'esar Claros Olivares, Austin J. Brockmeier

TL;DR
This paper systematically benchmarks out-of-distribution detection methods across various models, datasets, and training paradigms, revealing the importance of learned representations and proposing new analysis tools and improvements.
Contribution
It introduces a comprehensive benchmark framework, analyzes representation effects on detection, and proposes PCA filtering and NC-based prediction methods for OOD detection.
Findings
Simple probabilistic scores outperform complex ones across models.
Margin-based scores excel in near-OOD regimes, while geometry-aware scores perform better with severe shifts.
Representation collapse correlates with the effectiveness of boundary-aware scores.
Abstract
We present a systematic benchmark of out-of-distribution (OOD) detection CSFs through a representation-centric lens. Our study spans CNN and ViT backbones, multiple training paradigms, four image-classification source datasets (CIFAR-10, CIFAR-100, SuperCIFAR-100, and TinyImageNet), and OOD datasets grouped into near, mid, and far regimes using CLIP-derived semantic distances. To compare CSFs across these settings, we employ a multiple-comparison-controlled rank pipeline that identifies top cliques of statistically indistinguishable winners under threshold-free ranking metrics (AURC and AUGRC). The main empirical finding is that the competitive detector family depends more on the learned representation than on score design alone. For both CNNs and ViTs, simple probabilistic scores dominate misclassification detection. On CNNs, margin-based scores are strongest in near-OOD regimes, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
