TL;DR
TBDQ-Net is an efficient end-to-end multi-object tracking framework that combines tracking-by-detection and tracking-by-query paradigms, achieving high accuracy and speed with a lightweight design.
Contribution
The paper introduces TBDQ-Net, a novel framework that synergizes TBD and TBQ, with modules addressing occlusion and alignment, reducing parameters and inference time.
Findings
Outperforms leading TBD methods by 6.0 IDF1 on DanceTrack.
Achieves best TBQ performance on MOT20 benchmark.
Reduces trainable parameters by ~80% and speeds up inference by 37.5%.
Abstract
Multi-object tracking (MOT) is primarily dominated by two paradigms: tracking-by-detection (TBD) and tracking-by-query (TBQ). While TBD offers modular efficiency, its fragmented association pipeline often limits robustness in complex scenarios. Conversely, TBQ enhances semantic modeling end-to-end but suffers from high training costs and slow inference due to the tight coupling of detection and association. In this work, we propose the tracking-by-detection-and-query framework, TBDQ-Net, to advance the synergy between TBD and TBQ paradigms. By integrating a frozen detector with a lightweight associator, this architecture ensures intrinsic efficiency. Within this streamlined framework, we introduce tailored designs to address MOT-specific challenges. Concretely, we alleviate task conflicts and occlusions through the dual-stream update of the Basic Information Interaction (BII) module.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
