From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection
Jiajun Deng, Wengang Zhou, Yanyong Zhang, and Houqiang Li

TL;DR
This paper introduces H^2 3D R-CNN, a novel architecture for 3D object detection from LiDAR point clouds that effectively fuses multi-view features and hallucinated 3D representations, outperforming state-of-the-art methods.
Contribution
The paper proposes a new framework that combines multi-view feature extraction with a bilaterally guided fusion and hierarchical voxel pooling for improved 3D detection.
Findings
Outperforms state-of-the-art on KITTI and Waymo datasets
Efficiently fuses perspective and bird-eye view features
Demonstrates superior effectiveness and efficiency
Abstract
As an emerging data modal with precise distance sensing, LiDAR point clouds have been placed great expectations on 3D scene understanding. However, point clouds are always sparsely distributed in the 3D space, and with unstructured storage, which makes it difficult to represent them for effective 3D object detection. To this end, in this work, we regard point clouds as hollow-3D data and propose a new architecture, namely Hallucinated Hollow-3D R-CNN (3D R-CNN), to address the problem of 3D object detection. In our approach, we first extract the multi-view features by sequentially projecting the point clouds into the perspective view and the bird-eye view. Then, we hallucinate the 3D representation by a novel bilaterally guided multi-view fusion block. Finally, the 3D objects are detected via a box refinement module with a novel Hierarchical Voxel RoI Pooling operation. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Human Pose and Action Recognition
MethodseToro Customer Care Number +1-833-534-1729 · Max Pooling · Voxel RoI Pooling
