Fully Sparse 3D Object Detection
Lue Fan, Feng Wang, Naiyan Wang, Zhaoxiang Zhang

TL;DR
This paper introduces FSD, a fully sparse 3D object detector for LiDAR data that significantly reduces computational costs and improves long-range detection performance in autonomous driving.
Contribution
The paper proposes a novel fully sparse architecture with a sparse instance recognition module, enabling efficient long-range 3D detection without dense feature maps.
Findings
FSD achieves state-of-the-art results on Waymo and Argoverse 2 datasets.
FSD is 2.4 times faster than dense detectors at large perception ranges.
FSD maintains linear computational complexity relative to the number of points.
Abstract
As the perception range of LiDAR increases, LiDAR-based 3D object detection becomes a dominant task in the long-range perception task of autonomous driving. The mainstream 3D object detectors usually build dense feature maps in the network backbone and prediction head. However, the computational and spatial costs on the dense feature map are quadratic to the perception range, which makes them hardly scale up to the long-range setting. To enable efficient long-range LiDAR-based object detection, we build a fully sparse 3D object detector (FSD). The computational and spatial cost of FSD is roughly linear to the number of points and independent of the perception range. FSD is built upon the general sparse voxel encoder and a novel sparse instance recognition (SIR) module. SIR first groups the points into instances and then applies instance-wise feature extraction and prediction. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
