Stereo R-CNN based 3D Object Detection for Autonomous Driving
Peiliang Li, Xiaozhi Chen, and Shaojie Shen

TL;DR
This paper introduces Stereo R-CNN, a novel stereo image-based 3D object detection method for autonomous driving that outperforms existing methods without requiring depth or 3D supervision.
Contribution
It extends Faster R-CNN for stereo inputs, integrating keypoint, viewpoint, and dimension prediction to improve 3D detection accuracy without depth supervision.
Findings
Outperforms state-of-the-art stereo-based methods by 30% AP on KITTI.
Does not require depth input or 3D supervision.
Achieves superior 3D detection and localization results.
Abstract
We propose a 3D object detection method for autonomous driving by fully exploiting the sparse and dense, semantic and geometry information in stereo imagery. Our method, called Stereo R-CNN, extends Faster R-CNN for stereo inputs to simultaneously detect and associate object in left and right images. We add extra branches after stereo Region Proposal Network (RPN) to predict sparse keypoints, viewpoints, and object dimensions, which are combined with 2D left-right boxes to calculate a coarse 3D object bounding box. We then recover the accurate 3D bounding box by a region-based photometric alignment using left and right RoIs. Our method does not require depth input and 3D position supervision, however, outperforms all existing fully supervised image-based methods. Experiments on the challenging KITTI dataset show that our method outperforms the state-of-the-art stereo-based method by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Vision and Imaging
MethodsRegion Proposal Network · Softmax · Convolution · RoIPool · Faster R-CNN
