Graph Neural Networks for Cross-Camera Data Association
Elena Luna, Juan C. SanMiguel, Jos\'e M. Mart\'inez, and Pablo, Carballeira

TL;DR
This paper introduces a novel graph neural network approach for cross-camera data association that learns similarity measures and provides a global solution, outperforming traditional methods on pedestrian datasets.
Contribution
It proposes a global, learnable GNN-based method for cross-camera data association, avoiding pairwise processing and fixed distance metrics.
Findings
Outperforms existing data association techniques on EPFL pedestrian dataset.
Does not require training on the specific scenario it is tested in.
Provides an efficient, scalable solution for multi-camera data association.
Abstract
Cross-camera image data association is essential for many multi-camera computer vision tasks, such as multi-camera pedestrian detection, multi-camera multi-target tracking, 3D pose estimation, etc. This association task is typically stated as a bipartite graph matching problem and often solved by applying minimum-cost flow techniques, which may be computationally inefficient with large data. Furthermore, cameras are usually treated by pairs, obtaining local solutions, rather than finding a global solution at once. Other key issue is that of the affinity measurement: the widespread usage of non-learnable pre-defined distances, such as the Euclidean and Cosine ones. This paper proposes an efficient approach for cross-cameras data-association focused on a global solution, instead of processing cameras by pairs. To avoid the usage of fixed distances, we leverage the connectivity of Graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Multimodal Machine Learning Applications
