RangeRCNN: Towards Fast and Accurate 3D Object Detection with Range Image Representation
Zhidong Liang, Ming Zhang, Zehan Zhang, Xian Zhao, Shiliang Pu

TL;DR
RangeRCNN introduces a novel 3D object detection framework leveraging range image representation, combining dense 2D convolution with scale-adaptive modules and a two-stage RCNN, achieving state-of-the-art results efficiently.
Contribution
The paper proposes a new range image-based 3D detection method with a scale-adaptive module and a two-stage RCNN, improving efficiency and accuracy over voxel and point-based methods.
Findings
Achieves state-of-the-art performance on KITTI and Waymo datasets.
Demonstrates real-time detection capabilities.
Introduces effective data augmentation for range images.
Abstract
We present RangeRCNN, a novel and effective 3D object detection framework based on the range image representation. Most existing methods are voxel-based or point-based. Though several optimizations have been introduced to ease the sparsity issue and speed up the running time, the two representations are still computationally inefficient. Compared to them, the range image representation is dense and compact which can exploit powerful 2D convolution. Even so, the range image is not preferred in 3D object detection due to scale variation and occlusion. In this paper, we utilize the dilated residual block (DRB) to better adapt different object scales and obtain a more flexible receptive field. Considering scale variation and occlusion, we propose the RV-PV-BEV (range view-point view-bird's eye view) module to transfer features from RV to BEV. The anchor is defined in BEV which avoids scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Advanced Vision and Imaging
MethodsBatch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Residual Block · Convolution
