Super Sparse 3D Object Detection
Lue Fan, Yuxue Yang, Feng Wang, Naiyan Wang, and Zhaoxiang Zhang

TL;DR
This paper introduces a fully sparse 3D object detection framework, FSD and FSD++, that efficiently handles long-range perception in autonomous driving by leveraging sparse features and temporal information to reduce computational costs.
Contribution
The paper proposes a novel fully sparse detection architecture and a super sparse variant that effectively reduces data redundancy for long-range 3D detection.
Findings
Achieves state-of-the-art performance on Waymo Open Dataset.
Demonstrates superior long-range detection on Argoverse 2 Dataset.
Reduces computational overhead significantly with super sparse data.
Abstract
As the perception range of LiDAR expands, LiDAR-based 3D object detection contributes ever-increasingly to the long-range perception in autonomous driving. Mainstream 3D object detectors often build dense feature maps, where the cost is quadratic to the perception range, making them hardly scale up to the long-range settings. To enable efficient long-range detection, we first propose a fully sparse object detector termed FSD. FSD is built upon the general sparse voxel encoder and a novel sparse instance recognition (SIR) module. SIR groups the points into instances and applies highly-efficient instance-wise feature extraction. The instance-wise grouping sidesteps the issue of the center feature missing, which hinders the design of the fully sparse architecture. To further enjoy the benefit of fully sparse characteristic, we leverage temporal information to remove data redundancy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Robotics and Sensor-Based Localization
