One Graph to Track Them All: Dynamic GNNs for Single- and Multi-View Tracking

Martin Engilberge; Ivan Vrkic; Friedrich Wilke Grosche; Julien Pilet; Engin Turetken; and Pascal Fua

arXiv:2507.08494·cs.CV·January 1, 2026

One Graph to Track Them All: Dynamic GNNs for Single- and Multi-View Tracking

Martin Engilberge, Ivan Vrkic, Friedrich Wilke Grosche, Julien Pilet, Engin Turetken, and Pascal Fua

PDF

TL;DR

This paper introduces a unified, differentiable dynamic graph model for multi-people tracking that effectively handles occlusions and diverse scenes, outperforming existing methods on multiple benchmarks.

Contribution

It proposes a novel fully differentiable dynamic graph approach for multi-people tracking, eliminating the need for pre-computed tracklets and incorporating scene-specific information.

Findings

01

Achieves state-of-the-art results on public benchmarks.

02

Introduces a new large-scale multi-view dataset with complex occlusions.

03

Demonstrates flexibility across various tracking conditions.

Abstract

This work presents a unified, fully differentiable model for multi-people tracking that learns to associate detections into trajectories without relying on pre-computed tracklets. The model builds a dynamic spatiotemporal graph that aggregates spatial, contextual, and temporal information, enabling seamless information propagation across entire sequences. To improve occlusion handling, the graph can also encode scene-specific information. We also introduce a new large-scale dataset with 25 partially overlapping views, detailed scene reconstructions, and extensive occlusions. Experiments show the model achieves state-of-the-art performance on public benchmarks and the new dataset, with flexibility across diverse conditions. Both the dataset and approach will be publicly released to advance research in multi-people tracking.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.