Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Zimeng Fang, Chao Liang, Xue Zhou, Shuyuan Zhu, and Xi Li

TL;DR
The paper introduces AED, a unified tracking framework that effectively handles both known and unknown object categories in multi-object tracking by relying solely on robust feature learning and similarity decoding, outperforming existing methods.
Contribution
AED is a novel unified framework that integrates CV-MOT and OV-MOT, eliminating the need for prior knowledge and improving tracking accuracy across diverse categories.
Findings
Achieves superior performance on TAO, SportsMOT, and DanceTrack datasets.
Effectively tracks unknown categories without prior knowledge.
Outperforms existing CV-MOT and OV-MOT methods.
Abstract
Multi-object tracking (MOT) emerges as a pivotal and highly promising branch in the field of computer vision. Classical closed-vocabulary MOT (CV-MOT) methods aim to track objects of predefined categories. Recently, some open-vocabulary MOT (OV-MOT) methods have successfully addressed the problem of tracking unknown categories. However, we found that the CV-MOT and OV-MOT methods each struggle to excel in the tasks of the other. In this paper, we present a unified framework, Associate Everything Detected (AED), that simultaneously tackles CV-MOT and OV-MOT by integrating with any off-the-shelf detector and supports unknown categories. Different from existing tracking-by-detection MOT methods, AED gets rid of prior knowledge (e.g. motion cues) and relies solely on highly robust feature learning to handle complex trajectories in OV-MOT tasks while keeping excellent performance in CV-MOT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
