DRTAM: Dual Rank-1 Tensor Attention Module
Hanxing Chi, Baihong Lin, Jun Hu, Liang Wang

TL;DR
DRTAM introduces a dual rank-1 tensor attention module that enhances feature refinement in CNNs by capturing local and long-range context, improving performance across large and mobile networks.
Contribution
It proposes a novel dual attention mechanism using rank-1 tensor attention and residual descriptors, advancing attention modules for CNNs with superior adaptability and effectiveness.
Findings
Achieves competitive accuracy on ImageNet-1K, MS COCO, and PASCAL VOC.
Effective on both large and mobile networks.
Outperforms existing attention modules in various benchmarks.
Abstract
Recently, attention mechanisms have been extensively investigated in computer vision, but few of them show excellent performance on both large and mobile networks. This paper proposes Dual Rank-1 Tensor Attention Module (DRTAM), a novel residual-attention-learning-guided attention module for feed-forward convolutional neural networks. Given a 3D feature tensor map, DRTAM firstly generates three 2D feature descriptors along three axes. Then, using three descriptors, DRTAM sequentially infers two rank-1 tensor attention maps, the initial attention map and the complement attention map, combines and multiplied them to the input feature map for adaptive feature refinement(see Fig.1(c)). To generate two attention maps, DRTAM introduces rank-1 tensor attention module (RTAM) and residual descriptors extraction module (RDEM): RTAM divides each 2D feature descriptors into several chunks, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Advanced Image and Video Retrieval Techniques
MethodsStrip Pooling
