3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds
Le Hui, Lingpeng Wang, Mingmei Cheng, Jin Xie, Jian Yang

TL;DR
This paper introduces a novel 3D Siamese voxel-to-BEV tracking method that enhances sparse point cloud tracking by combining shape-aware feature learning with dense BEV localization, significantly outperforming existing methods.
Contribution
The paper proposes a new Siamese voxel-to-BEV tracker with shape-aware features and dense BEV localization, improving 3D object tracking in sparse point clouds.
Findings
Outperforms state-of-the-art on KITTI and nuScenes datasets
Effectively captures 3D shape information for better discrimination
Achieves significant accuracy improvements in sparse environments
Abstract
3D object tracking in point clouds is still a challenging problem due to the sparsity of LiDAR points in dynamic environments. In this work, we propose a Siamese voxel-to-BEV tracker, which can significantly improve the tracking performance in sparse 3D point clouds. Specifically, it consists of a Siamese shape-aware feature learning network and a voxel-to-BEV target localization network. The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified. To this end, we first perform template feature embedding to embed the template's feature into the potential target and then generate a dense 3D shape to characterize the shape information of the potential target. For localizing the tracked target, the voxel-to-BEV…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsVideo Surveillance and Tracking Methods · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage
MethodsMax Pooling
