Feature Selective Networks for Object Detection
Yao Zhai, Jingjing Fu, Yan Lu, Houqiang Li

TL;DR
This paper introduces feature selective networks that enhance object detection by exploiting sub-region and aspect ratio disparities in RoI features, leading to significant performance improvements across multiple datasets.
Contribution
The proposed network reformulates RoI features using sub-region and aspect ratio attention banks, improving detection accuracy with minimal additional complexity.
Findings
Over 3% mAP improvement on PASCAL VOC and MS COCO datasets.
Effective enhancement across various backbone networks like ResNet-101, GoogLeNet, and VGG-16.
Simple integration yields consistent detection performance boosts.
Abstract
Objects for detection usually have distinct characteristics in different sub-regions and different aspect ratios. However, in prevalent two-stage object detection methods, Region-of-Interest (RoI) features are extracted by RoI pooling with little emphasis on these translation-variant feature components. We present feature selective networks to reform the feature representations of RoIs by exploiting their disparities among sub-regions and aspect ratios. Our network produces the sub-region attention bank and aspect ratio attention bank for the whole image. The RoI-based sub-region attention map and aspect ratio attention map are selectively pooled from the banks, and then used to refine the original RoI features for RoI classification. Equipped with a light-weight detection subnetwork, our network gets a consistent boost in detection performance based on general ConvNet backbones…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
Methods1x1 Convolution · Convolution · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling
