TL;DR
This paper introduces a training-free, multi-layer feature aggregation method for out-of-distribution detection that improves robustness across various neural network architectures.
Contribution
It proposes a simple, model-agnostic approach leveraging internal multi-layer representations and class prototypes for enhanced OOD detection performance.
Findings
Improves AUROC by up to 4.41% on OOD benchmarks.
Reduces FPR by 13.58%, demonstrating better OOD detection.
Effective across diverse architectures and datasets.
Abstract
Deep learning models are increasingly deployed in safety-critical applications, where reliable out-of-distribution (OOD) detection is essential to ensure robustness. Existing methods predominantly rely on the penultimate-layer activations of neural networks, assuming they encapsulate the most informative in-distribution (ID) representations. In this work, we revisit this assumption to show that intermediate layers encode equally rich and discriminative information for OOD detection. Based on this observation, we propose a simple yet effective model-agnostic approach that leverages internal representations across multiple layers. Our scheme aggregates features from successive convolutional blocks, computes class-wise mean embeddings, and applies L_2 normalization to form compact ID prototypes capturing class semantics. During inference, cosine similarity between test features and these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
