P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for   LiDAR Point Clouds

Jiahao Nie; Fei Xie; Sifan Zhou; Xueyi Zhou; Dong-Kyu Chae; Zhiwei He

arXiv:2407.05238·cs.CV·April 24, 2025

P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds

Jiahao Nie, Fei Xie, Sifan Zhou, Xueyi Zhou, Dong-Kyu Chae, Zhiwei He

PDF

Open Access 1 Repo

TL;DR

This paper introduces P2P, a novel part-to-part motion modeling framework for LiDAR point cloud tracking, which leverages detailed motion cues to improve accuracy and speed over existing methods.

Contribution

The paper proposes a new part-to-part motion modeling framework for LiDAR point cloud tracking, enabling more accurate and efficient target motion inference.

Findings

01

P2P-voxel achieves state-of-the-art accuracy (~89%) on KITTI.

02

P2P-point outperforms previous motion trackers by 3.3% and 6.7%.

03

P2P runs at 107 FPS on a single GPU.

Abstract

3D single object tracking (SOT) methods based on appearance matching has long suffered from insufficient appearance information incurred by incomplete, textureless and semantically deficient LiDAR point clouds. While motion paradigm exploits motion cues instead of appearance matching for tracking, it incurs complex multi-stage processing and segmentation module. In this paper, we first provide in-depth explorations on motion paradigm, which proves that (\textbf{i}) it is feasible to directly infer target relative motion from point clouds across consecutive frames; (\textbf{ii}) fine-grained information comparison between consecutive point clouds facilitates target motion modeling. We thereby propose to perform part-to-part motion modeling for consecutive point clouds and introduce a novel tracking framework, termed \textbf{P2P}. The novel framework fuses each corresponding part…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haooozi/P2P
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optical Sensing Technologies · Advanced Neural Network Applications · Robotics and Sensor-Based Localization

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings