Information Bottleneck Approach to Spatial Attention Learning
Qiuxia Lai, Yu Li, Ailing Zeng, Minhao Liu, Hanqiu Sun and, Qiang Xu

TL;DR
This paper introduces an information bottleneck-inspired spatial attention module for deep neural networks that enhances interpretability and focus on relevant image regions by balancing information compression and task accuracy.
Contribution
It proposes a novel IB-based attention mechanism with quantized scores, improving interpretability and performance in visual recognition tasks.
Findings
Attention maps highlight relevant regions and suppress backgrounds.
The method improves accuracy in image classification and recognition tasks.
Attention maps are interpretable and align with human visual focus.
Abstract
The selective visual attention mechanism in the human visual system (HVS) restricts the amount of information to reach visual awareness for perceiving natural scenes, allowing near real-time information processing with limited computational capacity [Koch and Ullman, 1987]. This kind of selectivity acts as an 'Information Bottleneck (IB)', which seeks a trade-off between information compression and predictive accuracy. However, such information constraints are rarely explored in the attention mechanism for deep neural networks (DNNs). In this paper, we propose an IB-inspired spatial attention module for DNN structures built for visual recognition. The module takes as input an intermediate representation of the input image, and outputs a variational 2D attention map that minimizes the mutual information (MI) between the attention-modulated representation and the input, while maximizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Visual Attention and Saliency Detection
MethodsMax Pooling · Convolution · Average Pooling · Sigmoid Activation
