Spatial Information Bottleneck for Interpretable Visual Recognition

Kaixiang Shu; Kai Meng; Junqin Luo

arXiv:2511.09239·cs.CV·November 13, 2025

Spatial Information Bottleneck for Interpretable Visual Recognition

Kaixiang Shu, Kai Meng, Junqin Luo

PDF

Open Access

TL;DR

This paper introduces Spatial Information Bottleneck (S-IB), a method that improves neural network interpretability and robustness by spatially disentangling class-relevant information from background noise, leading to better explanations and accuracy.

Contribution

The paper presents a novel information-theoretic framework and a spatial disentanglement method (S-IB) that enhances interpretability and robustness of neural networks.

Findings

01

Improved visualization quality across multiple explanation methods.

02

Enhanced foreground focus and background suppression in explanations.

03

Consistent accuracy improvements on five benchmarks.

Abstract

Deep neural networks typically learn spatially entangled representations that conflate discriminative foreground features with spurious background correlations, thereby undermining model interpretability and robustness. We propose a novel understanding framework for gradient-based attribution from an information-theoretic perspective. We prove that, under mild conditions, the Vector-Jacobian Products (VJP) computed during backpropagation form minimal sufficient statistics of input features with respect to class labels. Motivated by this finding, we propose an encoding-decoding perspective : forward propagation encodes inputs into class space, while VJP in backpropagation decodes this encoding back to feature space. Therefore, we propose Spatial Information Bottleneck (S-IB) to spatially disentangle information flow. By maximizing mutual information between foreground VJP and inputs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis