NOAH: Learning Pairwise Object Category Attentions for Image Classification
Chao Li, Aojun Zhou, Anbang Yao

TL;DR
NOAH introduces a novel attention mechanism called pairwise object category attention (POCA) that enhances feature encoding in DNNs, leading to significant performance improvements across various image classification architectures and datasets.
Contribution
The paper proposes NOAH, a plug-and-play head that utilizes POCA to exploit spatially dense category-specific attentions, improving classification accuracy without sacrificing efficiency.
Findings
Significant accuracy improvements on ImageNet across diverse architectures.
Enhanced performance on multi-label image classification benchmarks.
Effective in both lightweight and large-scale DNNs.
Abstract
A modern deep neural network (DNN) for image classification tasks typically consists of two parts: a backbone for feature extraction, and a head for feature encoding and class predication. We observe that the head structures of mainstream DNNs adopt a similar feature encoding pipeline, exploiting global feature dependencies while disregarding local ones. In this paper, we revisit the feature encoding problem, and propose Non-glObal Attentive Head (NOAH) that relies on a new form of dot-product attention called pairwise object category attention (POCA), efficiently exploiting spatially dense category-specific attentions to augment classification performance. NOAH introduces a neat combination of feature split, transform and merge operations to learn POCAs at local to global scales. As a drop-in design, NOAH can be easily used to replace existing heads of various types of DNNs, improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · COVID-19 diagnosis using AI · Advanced Image and Video Retrieval Techniques
MethodsBatch Normalization · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Average Pooling · Inverted Residual Block · Convolution · 1x1 Convolution
