Shift-and-Balance Attention

Chunjie Luo; Jianfeng Zhan; Tianshu Hao; Lei Wang; Wanling Gao

arXiv:2103.13080·cs.CV·March 25, 2021·1 cites

Shift-and-Balance Attention

Chunjie Luo, Jianfeng Zhan, Tianshu Hao, Lei Wang, Wanling Gao

PDF

Open Access 1 Repo

TL;DR

This paper introduces Shift-and-Balance attention, a novel method that improves deep neural network performance by better regulating the contribution of attention mechanisms, outperforming existing methods like Squeeze-and-Excite and Dynamic Convolution.

Contribution

The paper proposes a new attention regulation technique called Shift-and-Balance, which enhances network accuracy by controlling the influence of attention branches more effectively.

Findings

01

Shift-and-Balance outperforms Squeeze-and-Excite in accuracy when applied in more layers.

02

It significantly improves network capacity and size.

03

Achieves comparable or better accuracy than state-of-the-art Dynamic Convolution.

Abstract

Attention is an effective mechanism to improve the deep model capability. Squeeze-and-Excite (SE) introduces a light-weight attention branch to enhance the network's representational power. The attention branch is gated using the Sigmoid function and multiplied by the feature map's trunk branch. It is too sensitive to coordinate and balance the trunk and attention branches' contributions. To control the attention branch's influence, we propose a new attention method, called Shift-and-Balance (SB). Different from Squeeze-and-Excite, the attention branch is regulated by the learned control factor to control the balance, then added into the feature map's trunk branch. Experiments show that Shift-and-Balance attention significantly improves the accuracy compared to Squeeze-and-Excite when applied in more layers, increasing more size and capacity of a network. Moreover, Shift-and-Balance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuranusduke/Shift_and_Balance_Attention
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

MethodsConvolution