STEAM: Squeeze and Transform Enhanced Attention Module
Rishabh Sabharwal, Ram Samarth B B, Parikshit Singh Rathore, Punit, Rathore

TL;DR
This paper introduces STEAM, a novel graph-based attention module that models both channel and spatial attention efficiently, improving CNN performance with minimal additional computational cost.
Contribution
The paper proposes the first graph-based approach to jointly model channel and spatial attention with constant parameters, including a new pooling method for spatial context.
Findings
Achieves 2% higher accuracy on image classification with minimal GFLOPs increase.
Outperforms ECA and GCT modules in accuracy while reducing GFLOPs threefold.
Effective for object detection and segmentation tasks.
Abstract
Channel and spatial attention mechanisms introduced by earlier works enhance the representation abilities of deep convolutional neural networks (CNNs) but often lead to increased parameter and computation costs. While recent approaches focus solely on efficient feature context modeling for channel attention, we aim to model both channel and spatial attention comprehensively with minimal parameters and reduced computation. Leveraging the principles of relational modeling in graphs, we introduce a constant-parameter module, STEAM: Squeeze and Transform Enhanced Attention Module, which integrates channel and spatial attention to enhance the representation power of CNNs. To our knowledge, we are the first to propose a graph-based approach for modeling both channel and spatial attention, utilizing concepts from multi-head graph transformers. Additionally, we introduce Output Guided Pooling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiation Effects in Electronics · Advanced Neural Network Applications · Neural Networks and Applications
MethodsSoftmax · Attention Is All You Need · Gated Channel Transformation · Focus
