Self-Attentive Pooling for Efficient Deep Learning

Fang Chen; Gourav Datta; Souvik Kundu; Peter Beerel

arXiv:2209.07659·cs.CV·January 2, 2023·1 cites

Self-Attentive Pooling for Efficient Deep Learning

Fang Chen, Gourav Datta, Souvik Kundu, Peter Beerel

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel non-local self-attentive pooling method that improves feature aggregation in CNNs, leading to higher accuracy and reduced memory usage, enabling deployment on resource-constrained devices.

Contribution

We propose a self-attentive pooling technique that captures non-local dependencies, outperforming existing pooling methods in accuracy and efficiency for deep learning models.

Findings

01

Surpasses state-of-the-art pooling techniques in accuracy on ImageNet.

02

Achieves up to 22x reduction in memory consumption during inference.

03

Improves MobileNet-V2 performance by 1.2% on ImageNet.

Abstract

Efficient custom pooling techniques that can aggressively trim the dimensions of a feature map and thereby reduce inference compute and memory footprint for resource-constrained computer vision applications have recently gained significant traction. However, prior pooling works extract only the local context of the activation maps, limiting their effectiveness. In contrast, we propose a novel non-local self-attentive pooling method that can be used as a drop-in replacement to the standard pooling layers, such as max/average pooling or strided convolution. The proposed self-attention module uses patch embedding, multi-head self-attention, and spatial-channel restoration, followed by sigmoid activation and exponential soft-max. This self-attention mechanism efficiently aggregates dependencies between non-local activation patches during down-sampling. Extensive experiments on standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Self-Attentive Pooling for Efficient Deep Learning· youtube

Taxonomy

TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Domain Adaptation and Few-Shot Learning

MethodsPruning · Test · Sigmoid Activation