DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds
Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu,, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li

TL;DR
DetZero introduces a novel offboard 3D detection paradigm that leverages long-term sequential point clouds with an offline tracker and attention-based refinement, achieving state-of-the-art results on Waymo dataset.
Contribution
The paper proposes a new offboard 3D detection framework with an offline tracker and attention mechanism, significantly improving trajectory completeness and long-term context utilization.
Findings
Outperforms existing methods on Waymo dataset
Achieves 85.15 mAPH (L2) detection performance
Validates high-quality detection as a substitute for human labels
Abstract
Existing offboard 3D detectors always follow a modular pipeline design to take advantage of unlimited sequential point clouds. We have found that the full potential of offboard 3D detectors is not explored mainly due to two reasons: (1) the onboard multi-object tracker cannot generate sufficient complete object trajectories, and (2) the motion state of objects poses an inevitable challenge for the object-centric refining stage in leveraging the long-term temporal context representation. To tackle these problems, we propose a novel paradigm of offboard 3D object detection, named DetZero. Concretely, an offline tracker coupled with a multi-frame detector is proposed to focus on the completeness of generated object tracks. An attention-mechanism refining module is proposed to strengthen contextual information interaction across long-term sequential point clouds for object refining with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
MethodsFocus
