Auto-Context R-CNN
Bo Li, Tianfu Wu, Lun Zhang, Rufeng Chu

TL;DR
Auto-Context R-CNN enhances object detection by integrating adaptive contextual information around RoIs, leading to improved accuracy and robustness, especially for small objects and occlusions, through a novel context-mining operator.
Contribution
The paper introduces RoICtxMining, a simple two-layer extension of RoIPooling/RoIAlign, enabling effective context integration in R-CNNs for improved detection performance.
Findings
Achieves 6.9% mAP improvement on COCO test-dev
First place in KITTI pedestrian and cyclist detection
Robust to occlusion and small objects
Abstract
Region-based convolutional neural networks (R-CNN)~\cite{fast_rcnn,faster_rcnn,mask_rcnn} have largely dominated object detection. Operators defined on RoIs (Region of Interests) play an important role in R-CNNs such as RoIPooling~\cite{fast_rcnn} and RoIAlign~\cite{mask_rcnn}. They all only utilize information inside RoIs for RoI prediction, even with their recent deformable extensions~\cite{deformable_cnn}. Although surrounding context is well-known for its importance in object detection, it has yet been integrated in R-CNNs in a flexible and effective way. Inspired by the auto-context work~\cite{auto_context} and the multi-class object layout work~\cite{nms_context}, this paper presents a generic context-mining RoI operator (i.e., \textit{RoICtxMining}) seamlessly integrated in R-CNNs, and the resulting object detection system is termed \textbf{Auto-Context R-CNN} which is trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
MethodsRoIAlign
