SA-Net: Shuffle Attention for Deep Convolutional Neural Networks
Qing-Long Zhang Yu-Bin Yang

TL;DR
SA-Net introduces an efficient shuffle attention module that combines spatial and channel attention mechanisms using shuffle units, significantly improving neural network performance with lower computational costs across multiple vision tasks.
Contribution
The paper proposes a novel Shuffle Attention (SA) module that effectively fuses spatial and channel attention with reduced computational overhead, enhancing deep neural network performance.
Findings
SA achieves over 1.34% top-1 accuracy improvement on ImageNet-1k.
SA reduces parameters and FLOPs compared to baseline models.
Experimental results show SA outperforms state-of-the-art methods on multiple benchmarks.
Abstract
Attention mechanisms, which enable a neural network to accurately focus on all the relevant elements of the input, have become an essential component to improve the performance of deep neural networks. There are mainly two attention mechanisms widely used in computer vision studies, \textit{spatial attention} and \textit{channel attention}, which aim to capture the pixel-level pairwise relationship and channel dependency, respectively. Although fusing them together may achieve better performance than their individual implementations, it will inevitably increase the computational overhead. In this paper, we propose an efficient Shuffle Attention (SA) module to address this issue, which adopts Shuffle Units to combine two types of attention mechanisms effectively. Specifically, SA first groups channel dimensions into multiple sub-features before processing them in parallel. Then, for each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
