Enhanced Kalman with Adaptive Appearance Motion SORT for Grounded Generic Multiple Object Tracking
Duy Le Dinh Anh, Kim Hoang Tran, Quang-Thuc Nguyen, Ngan Hoang Le

TL;DR
This paper introduces Grounded-GMOT, a new paradigm for multi-object tracking using natural language, along with the G2MOT dataset and the KAM-SORT tracker, which effectively combines appearance and motion cues for generic object tracking.
Contribution
It presents the G2MOT dataset, a novel tracking method KAM-SORT that enhances Kalman filtering, and demonstrates improved performance over existing GMOT approaches.
Findings
Grounded-GMOT outperforms existing OneShot-GMOT methods.
KAM-SORT effectively integrates appearance and motion cues.
The G2MOT dataset provides diverse videos with textual descriptions.
Abstract
Despite recent progress, Multi-Object Tracking (MOT) continues to face significant challenges, particularly its dependence on prior knowledge and predefined categories, complicating the tracking of unfamiliar objects. Generic Multiple Object Tracking (GMOT) emerges as a promising solution, requiring less prior information. Nevertheless, existing GMOT methods, primarily designed as OneShot-GMOT, rely heavily on initial bounding boxes and often struggle with variations in viewpoint, lighting, occlusion, and scale. To overcome the limitations inherent in both MOT and GMOT when it comes to tracking objects with specific generic attributes, we introduce Grounded-GMOT, an innovative tracking paradigm that enables users to track multiple generic objects in videos through natural language descriptors. Our contributions begin with the introduction of the G2MOT dataset, which includes a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Robotics and Sensor-Based Localization · Inertial Sensor and Navigation
