Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking

Ayesha Ishaq; Mohamed El Amine Boudjoghra; Jean Lahoud; Fahad Shahbaz; Khan; Salman Khan; Hisham Cholakkal; Rao Muhammad Anwer

arXiv:2410.01678·cs.CV·February 28, 2025

Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking

Ayesha Ishaq, Mohamed El Amine Boudjoghra, Jean Lahoud, Fahad Shahbaz, Khan, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer

PDF

Open Access 1 Repo

TL;DR

This paper introduces the first open-vocabulary 3D multi-object tracking framework that enhances adaptability to unseen objects in autonomous driving scenarios, addressing limitations of traditional category-constrained systems.

Contribution

It formulates open-vocabulary 3D tracking, creates new dataset splits, and proposes a novel method that generalizes tracking to unseen object classes in real-world environments.

Findings

01

Demonstrates robustness in diverse outdoor scenarios

02

Reduces performance gap between known and unseen objects

03

Provides publicly available code and datasets

Abstract

3D multi-object tracking plays a critical role in autonomous driving by enabling the real-time monitoring and prediction of multiple objects' movements. Traditional 3D tracking systems are typically constrained by predefined object categories, limiting their adaptability to novel, unseen objects in dynamic environments. To address this limitation, we introduce open-vocabulary 3D tracking, which extends the scope of 3D tracking to include objects beyond predefined categories. We formulate the problem of open-vocabulary 3D tracking and introduce dataset splits designed to represent various open-vocabulary scenarios. We propose a novel approach that integrates open-vocabulary capabilities into a 3D tracking framework, allowing for generalization to unseen object classes. Our method effectively reduces the performance gap between tracking known and novel objects through strategic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ayesha-ishaq/open3dtrack
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization