TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs
Shantanu Jaiswal, Basura Fernando, Cheston Tan

TL;DR
This paper introduces TDAM, a lightweight top-down attention module for CNNs that incorporates higher-level contextual information to improve feature selection, leading to better performance and interpretability in object recognition tasks.
Contribution
The paper proposes a novel top-down attention module (TDAM) that enhances CNNs by integrating semantic context and enabling attention shifting without supervision.
Findings
TDAM outperforms existing attention modules on multiple benchmarks.
TDAM improves ResNet50's weakly-supervised object localization by 5%.
TDAM is more parameter- and memory-efficient than comparable modules.
Abstract
Attention modules for Convolutional Neural Networks (CNNs) are an effective method to enhance performance on multiple computer-vision tasks. While existing methods appropriately model channel-, spatial- and self-attention, they primarily operate in a feedforward bottom-up manner. Consequently, the attention mechanism strongly depends on the local information of a single input feature map and does not incorporate relatively semantically-richer contextual information available at higher layers that can specify "what and where to look" in lower-level feature maps through top-down information flow. Accordingly, in this work, we propose a lightweight top-down attention module (TDAM) that iteratively generates a "visual searchlight" to perform channel and spatial modulation of its inputs and outputs more contextually-relevant feature maps at each computation step. Our experiments indicate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Brain Tumor Detection and Classification
MethodsBatch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Convolution · Communication--Guide||How Do I Communicate to Expedia? · Max Pooling · Average Pooling · Sigmoid Activation · How do i ask a question at Expedia?*AskExpertService · 1x1 Convolution
