TL;DR
This paper demonstrates that a simple, well-tuned range-view approach can achieve state-of-the-art 3D object detection results on lidar data, challenging the need for complex techniques.
Contribution
The authors develop a new range-view 3D object detection model that surpasses existing methods without relying on complex multi-resolution or IoU-based losses.
Findings
Range-view input feature dimensionality impacts performance
Classification loss based on 3D spatial proximity is highly effective
Range subsampling outperforms multi-resolution range-conditioned networks
Abstract
Lidar-based perception pipelines rely on 3D object detection models to interpret complex scenes. While multiple representations for lidar exist, the range-view is enticing since it losslessly encodes the entire lidar sensor output. In this work, we achieve state-of-the-art amongst range-view 3D object detection models without using multiple techniques proposed in past range-view literature. We explore range-view 3D object detection across two modern datasets with substantially different properties: Argoverse 2 and Waymo Open. Our investigation reveals key insights: (1) input feature dimensionality significantly influences the overall performance, (2) surprisingly, employing a classification loss grounded in 3D spatial proximity works as well or better compared to more elaborate IoU-based losses, and (3) addressing non-uniform lidar density via a straightforward range subsampling…
Peer Reviews
Decision·CoRL 2024
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
