DRTAM: Dual Rank-1 Tensor Attention Module

Hanxing Chi; Baihong Lin; Jun Hu; Liang Wang

arXiv:2203.05893·cs.CV·September 8, 2023

DRTAM: Dual Rank-1 Tensor Attention Module

Hanxing Chi, Baihong Lin, Jun Hu, Liang Wang

PDF

Open Access

TL;DR

DRTAM introduces a dual rank-1 tensor attention module that enhances feature refinement in CNNs by capturing local and long-range context, improving performance across large and mobile networks.

Contribution

It proposes a novel dual attention mechanism using rank-1 tensor attention and residual descriptors, advancing attention modules for CNNs with superior adaptability and effectiveness.

Findings

01

Achieves competitive accuracy on ImageNet-1K, MS COCO, and PASCAL VOC.

02

Effective on both large and mobile networks.

03

Outperforms existing attention modules in various benchmarks.

Abstract

Recently, attention mechanisms have been extensively investigated in computer vision, but few of them show excellent performance on both large and mobile networks. This paper proposes Dual Rank-1 Tensor Attention Module (DRTAM), a novel residual-attention-learning-guided attention module for feed-forward convolutional neural networks. Given a 3D feature tensor map, DRTAM firstly generates three 2D feature descriptors along three axes. Then, using three descriptors, DRTAM sequentially infers two rank-1 tensor attention maps, the initial attention map and the complement attention map, combines and multiplied them to the input feature map for adaptive feature refinement(see Fig.1(c)). To generate two attention maps, DRTAM introduces rank-1 tensor attention module (RTAM) and residual descriptors extraction module (RDEM): RTAM divides each 2D feature descriptors into several chunks, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Advanced Image and Video Retrieval Techniques

MethodsStrip Pooling