Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks
Xiang Li, Xiaolin Hu, Jian Yang

TL;DR
The paper introduces a lightweight Spatial Group-wise Enhance (SGE) module that improves semantic feature learning in CNNs by adaptively emphasizing relevant sub-features, leading to significant accuracy gains in image recognition tasks.
Contribution
The novel SGE module enhances semantic feature learning by spatially adjusting sub-feature importance without adding extra parameters, improving CNN performance.
Findings
Achieves 1.2% Top-1 accuracy improvement on ImageNet with ResNet50.
Provides 1.0-2.0% AP gain across various detectors on COCO.
Lightweight design with minimal computational overhead.
Abstract
The Convolutional Neural Networks (CNNs) generate the feature representation of complex objects by collecting hierarchical and different parts of semantic sub-features. These sub-features can usually be distributed in grouped form in the feature vector of each layer, representing various semantic entities. However, the activation of these sub-features is often spatially affected by similar patterns and noisy backgrounds, resulting in erroneous localization and identification. We propose a Spatial Group-wise Enhance (SGE) module that can adjust the importance of each sub-feature by generating an attention factor for each spatial location in each semantic group, so that every individual group can autonomously enhance its learnt expression and suppress possible noise. The attention factors are only guided by the similarities between the global and local feature descriptors inside each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsAverage Pooling · Focal Loss · Six Ways To Communicate To Someone At Expedia Via Phone And Email's. · Feature Pyramid Network · RetinaNet · Cascade R-CNN · RoIAlign · Mask R-CNN · Region Proposal Network · Softmax
