Activation Subspaces for Out-of-Distribution Detection

Bar{\i}\c{s} Z\"ong\"ur; Robin Hesse; Stefan Roth

arXiv:2508.21695·cs.LG·September 1, 2025

Activation Subspaces for Out-of-Distribution Detection

Bar{\i}\c{s} Z\"ong\"ur, Robin Hesse, Stefan Roth

PDF

Open Access

TL;DR

This paper introduces ActSub, a novel OOD detection method that leverages singular value decomposition of model weights to identify decisive and insignificant activation subspaces, improving detection especially under large distribution shifts.

Contribution

The paper proposes a new OOD detection approach using activation subspace decomposition, achieving state-of-the-art results across multiple benchmarks.

Findings

01

Insignificant subspace better detects Far-OOD data.

02

Decisive subspace enhances Near-OOD detection.

03

ActSub outperforms existing methods on standard benchmarks.

Abstract

To ensure the reliability of deep models in real-world applications, out-of-distribution (OOD) detection methods aim to distinguish samples close to the training distribution (in-distribution, ID) from those farther away (OOD). In this work, we propose a novel OOD detection method that utilizes singular value decomposition of the weight matrix of the classification head to decompose the model's activations into decisive and insignificant components, which contribute maximally, respectively minimally, to the final classifier output. We find that the subspace of insignificant components more effectively distinguishes ID from OOD data than raw activations in regimes of large distribution shifts (Far-OOD). This occurs because the classification objective leaves the insignificant subspace largely unaffected, yielding features that are ''untainted'' by the target classification task.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications