Accurate Single Stage Detector Using Recurrent Rolling Convolution
Jimmy Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, Jiahao Pang, Qiong, Yan, Yu-Wing Tai, Li Xu

TL;DR
This paper introduces a novel single-stage object detection network using Recurrent Rolling Convolution to enhance multi-scale features, achieving state-of-the-art results on the KITTI dataset for high IoU thresholds.
Contribution
The paper proposes Recurrent Rolling Convolution architecture for single-stage detection, significantly improving accuracy over previous methods on challenging benchmarks.
Findings
Outperformed all previous single-stage methods on KITTI dataset
Achieved first place in car and cyclist detection, second in pedestrian detection
Model based on reduced VGG-16 outperforms existing benchmarks
Abstract
Most of the recent successful methods in accurate object detection and localization used some variants of R-CNN style two stage Convolutional Neural Networks (CNN) where plausible regions were proposed in the first stage then followed by a second stage for decision refinement. Despite the simplicity of training and the efficiency in deployment, the single stage detection methods have not been as competitive when evaluated in benchmarks consider mAP for high IoU thresholds. In this paper, we proposed a novel single stage end-to-end trainable object detection network to overcome this limitation. We achieved this by introducing Recurrent Rolling Convolution (RRC) architecture over multi-scale feature maps to construct object classifiers and bounding box regressors which are "deep in context". We evaluated our method in the challenging KITTI dataset which measures methods under IoU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Video Surveillance and Tracking Methods
MethodsRegion Proposal Network · RoIPool · Faster R-CNN · Dropout · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Softmax · Ethereum Customer Service Number +1-833-534-1729 · Max Pooling · Convolution
