Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking
Yan Gao, Haojun Xu, Nannan Wang, Jie Li, Xinbo Gao

TL;DR
This paper introduces CoNo-Link, a graph neural network framework that models ultra-long frame information for multi-object tracking, improving long-term association accuracy with low storage overhead.
Contribution
It proposes a novel multi-scene generalized graph model using composite nodes for better long-term object association in tracking.
Findings
Outperforms state-of-the-art methods on standard datasets.
Effectively models ultra-long frame information with low storage.
Enhances feature representation by treating trajectories as nodes.
Abstract
The global multi-object tracking (MOT) system can consider interaction, occlusion, and other ``visual blur'' scenarios to ensure effective object tracking in long videos. Among them, graph-based tracking-by-detection paradigms achieve surprising performance. However, their fully-connected nature poses storage space requirements that challenge algorithm handling long videos. Currently, commonly used methods are still generated trajectories by building one-forward associations across frames. Such matches produced under the guidance of first-order similarity information may not be optimal from a longer-time perspective. Moreover, they often lack an end-to-end scheme for correcting mismatches. This paper proposes the Composite Node Message Passing Network (CoNo-Link), a multi-scene generalized framework for modeling ultra-long frames information for association. CoNo-Link's solution is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Video Analysis and Summarization · Advanced Computing and Algorithms
