Object Detection Networks on Convolutional Feature Maps
Shaoqing Ren, Kaiming He, Ross Girshick, Xiangyu Zhang, Jian Sun

TL;DR
This paper emphasizes the importance of designing deep, convolutional per-region classifiers for object detection, demonstrating that such classifiers significantly improve detection accuracy beyond just using advanced feature extractors.
Contribution
It introduces and evaluates Networks on Convolutional feature maps (NoCs), highlighting their critical role in achieving top detection performance, especially when combined with modern deep feature extractors.
Findings
Deep per-region classifiers improve detection accuracy.
NoCs contributed to winning entries in major challenges.
Advanced feature extractors alone are insufficient for optimal detection.
Abstract
Most object detectors contain two important components: a feature extractor and an object classifier. The feature extractor has rapidly evolved with significant research efforts leading to better deep convolutional architectures. The object classifier, however, has not received much attention and many recent systems (like SPPnet and Fast/Faster R-CNN) use simple multi-layer perceptrons. This paper demonstrates that carefully designing deep networks for object classification is just as important. We experiment with region-wise classifier networks that use shared, region-independent convolutional features. We call them "Networks on Convolutional feature maps" (NoCs). We discover that aside from deep feature maps, a deep and convolutional per-region classifier is of particular importance for object detection, whereas latest superior image classification models (such as ResNets and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Average Pooling · 1x1 Convolution · Region Proposal Network · Softmax · RoIPool · Faster R-CNN · Global Average Pooling · Bottleneck Residual Block
