ULSAM: Ultra-Lightweight Subspace Attention Module for Compact   Convolutional Neural Networks

Rajat Saini; Nandan Kumar Jha; Bedanta Das; Sparsh Mittal; C. Krishna; Mohan

arXiv:2006.15102·cs.CV·June 29, 2020

ULSAM: Ultra-Lightweight Subspace Attention Module for Compact Convolutional Neural Networks

Rajat Saini, Nandan Kumar Jha, Bedanta Das, Sparsh Mittal, C. Krishna, Mohan

PDF

1 Repo

TL;DR

ULSAM introduces a lightweight subspace attention module that enhances compact CNNs by enabling multi-scale feature representation, significantly reducing computational costs while improving accuracy on image classification tasks.

Contribution

The paper presents the first subspace attention mechanism for compact CNNs, improving efficiency and accuracy with minimal overhead.

Findings

01

13% FLOPs reduction in MobileNet-V2

02

25% parameter reduction in MobileNet-V2

03

Accuracy improvement of over 1% on ImageNet-1K

Abstract

The capability of the self-attention mechanism to model the long-range dependencies has catapulted its deployment in vision models. Unlike convolution operators, self-attention offers infinite receptive field and enables compute-efficient modeling of global dependencies. However, the existing state-of-the-art attention mechanisms incur high compute and/or parameter overheads, and hence unfit for compact convolutional neural networks (CNNs). In this work, we propose a simple yet effective "Ultra-Lightweight Subspace Attention Mechanism" (ULSAM), which infers different attention maps for each feature map subspace. We argue that leaning separate attention maps for each feature subspace enables multi-scale and multi-frequency feature representation, which is more desirable for fine-grained image classification. Our method of subspace attention is orthogonal and complementary to the existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Nandan91/ULSAM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution