TL;DR
This paper introduces a novel graph neural network framework with multi-level attention for group re-identification in video surveillance, effectively leveraging group context and dependencies.
Contribution
It proposes a unified graph-based model with multi-level attention for both group re-id and group-aware person re-id, along with a new large-scale dataset.
Findings
The framework outperforms existing methods on multiple datasets.
The multi-level attention mechanism improves context modeling.
The new dataset facilitates future research in group re-id.
Abstract
Learning to re-identify or retrieve a group of people across non-overlapped camera systems has important applications in video surveillance. However, most existing methods focus on (single) person re-identification (re-id), ignoring the fact that people often walk in groups in real scenarios. In this work, we take a step further and consider employing context information for identifying groups of people, i.e., group re-id. We propose a novel unified framework based on graph neural networks to simultaneously address the group-based re-id tasks, i.e., group re-id and group-aware person re-id. Specifically, we construct a context graph with group members as its nodes to exploit dependencies among different people. A multi-level attention mechanism is developed to formulate both intra-group and inter-group context, with an additional self-attention module for robust graph-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
