TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking

Mengmeng Wang; Haonan Wang; Yulong Li; Xiangjie Kong; Jiaxin Du; Guojiang Shen; Feng Xia

arXiv:2507.19908·cs.CV·July 29, 2025

TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking

Mengmeng Wang, Haonan Wang, Yulong Li, Xiangjie Kong, Jiaxin Du, Guojiang Shen, Feng Xia

PDF

TL;DR

TrackAny3D introduces a novel framework that leverages pretrained 3D models with adapters and specialized architectures to achieve category-agnostic 3D point cloud tracking, outperforming existing methods.

Contribution

The paper presents the first approach to transfer large-scale pretrained 3D models for category-agnostic 3D SOT, using adapters, Mixture-of-Geometry-Experts, and temporal optimization strategies.

Findings

01

Achieves state-of-the-art results on three benchmarks.

02

Demonstrates strong generalization across object categories.

03

Outperforms category-specific tracking methods.

Abstract

3D LiDAR-based single object tracking (SOT) relies on sparse and irregular point clouds, posing challenges from geometric variations in scale, motion patterns, and structural complexity across object categories. Current category-specific approaches achieve good accuracy but are impractical for real-world use, requiring separate models for each category and showing limited generalization. To tackle these issues, we propose TrackAny3D, the first framework to transfer large-scale pretrained 3D models for category-agnostic 3D SOT. We first integrate parameter-efficient adapters to bridge the gap between pretraining and tracking tasks while preserving geometric priors. Then, we introduce a Mixture-of-Geometry-Experts (MoGE) architecture that adaptively activates specialized subnetworks based on distinct geometric characteristics. Additionally, we design a temporal context optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.