GLT-T: Global-Local Transformer Voting for 3D Single Object Tracking in   Point Clouds

Jiahao Nie; Zhiwei He; Yuxiang Yang; Mingyu Gao; Jing Zhang

arXiv:2211.10927·cs.CV·November 22, 2022·1 cites

GLT-T: Global-Local Transformer Voting for 3D Single Object Tracking in Point Clouds

Jiahao Nie, Zhiwei He, Yuxiang Yang, Mingyu Gao, Jing Zhang

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces GLT-T, a novel 3D object tracking method that employs a global-local transformer voting scheme to generate high-quality proposals by enhancing seed point features and importance weighting, achieving state-of-the-art results.

Contribution

The paper proposes a global-local transformer module and an importance prediction strategy to improve seed point feature representation and voting in 3D object tracking.

Findings

01

Achieves state-of-the-art performance on KITTI and NuScenes benchmarks.

02

Demonstrates the effectiveness of the global-local transformer voting scheme.

03

Shows improved proposal quality over VoteNet through ablation studies.

Abstract

Current 3D single object tracking methods are typically based on VoteNet, a 3D region proposal network. Despite the success, using a single seed point feature as the cue for offset learning in VoteNet prevents high-quality 3D proposals from being generated. Moreover, seed points with different importance are treated equally in the voting process, aggravating this defect. To address these issues, we propose a novel global-local transformer voting scheme to provide more informative cues and guide the model pay more attention on potential seed points, promoting the generation of high-quality 3D proposals. Technically, a global-local transformer (GLT) module is employed to integrate object- and patch-aware prior into seed point features to effectively form strong feature representation for geometric positions of the seed points, thus providing more robust and accurate cues for offset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

GLT-T: Global-Local Transformer Voting for 3D Single Object Tracking in Point Clouds· underline

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Human Pose and Action Recognition