GMT: Effective Global Framework for Multi-Camera Multi-Target Tracking
Yihao Zhen, Mingyue Xu, Qiang Wang, Baojie Fan, Jiahua Dong, Tinghui Zhao, Huijie Fan

TL;DR
This paper introduces GMT, a unified global framework for multi-camera multi-target tracking that jointly exploits intra-view and inter-view cues, significantly improving tracking accuracy over existing two-stage methods.
Contribution
The paper proposes GMT, a novel global MCMT tracking framework with a Cross-View Feature Consistency Enhancement module and a Global Trajectory Association module, advancing beyond traditional two-stage approaches.
Findings
Achieves up to 21.3% improvement in CVMA
Achieves up to 17.2% improvement in CVIDF1
Introduces the VisionTrack dataset with greater diversity
Abstract
Multi-Camera Multi-Target (MCMT) tracking aims to locate and associate the same targets across multiple camera views. Existing methods typically adopt a two-stage framework, involving single-camera tracking followed by inter-camera tracking. However, in this paradigm, multi-view information is used only to recover missed matches in the first stage, providing a limited contribution to overall tracking. To address this issue, we propose GMT, a global MCMT tracking framework that jointly exploits intra-view and inter-view cues for tracking. Specifically, instead of assigning trajectories independently for each view, we integrate the same historical targets across different views as global trajectories, thereby reformulating the two-stage tracking as a unified global-level trajectory-target association process. We introduce a Cross-View Feature Consistency Enhancement (CFCE) module to align…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies
