Self-Attentive Pooling for Efficient Deep Learning
Fang Chen, Gourav Datta, Souvik Kundu, Peter Beerel

TL;DR
This paper introduces a novel non-local self-attentive pooling method that improves feature aggregation in CNNs, leading to higher accuracy and reduced memory usage, enabling deployment on resource-constrained devices.
Contribution
We propose a self-attentive pooling technique that captures non-local dependencies, outperforming existing pooling methods in accuracy and efficiency for deep learning models.
Findings
Surpasses state-of-the-art pooling techniques in accuracy on ImageNet.
Achieves up to 22x reduction in memory consumption during inference.
Improves MobileNet-V2 performance by 1.2% on ImageNet.
Abstract
Efficient custom pooling techniques that can aggressively trim the dimensions of a feature map and thereby reduce inference compute and memory footprint for resource-constrained computer vision applications have recently gained significant traction. However, prior pooling works extract only the local context of the activation maps, limiting their effectiveness. In contrast, we propose a novel non-local self-attentive pooling method that can be used as a drop-in replacement to the standard pooling layers, such as max/average pooling or strided convolution. The proposed self-attention module uses patch embedding, multi-head self-attention, and spatial-channel restoration, followed by sigmoid activation and exponential soft-max. This self-attention mechanism efficiently aggregates dependencies between non-local activation patches during down-sampling. Extensive experiments on standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Self-Attentive Pooling for Efficient Deep Learning· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Domain Adaptation and Few-Shot Learning
MethodsPruning · Test · Sigmoid Activation
