Diversity Regularized Spatiotemporal Attention for Video-based Person Re-identification
Shuang Li, Slawomir Bak, Peter Carr, Xiaogang Wang

TL;DR
This paper introduces a novel diversity regularized spatiotemporal attention model for video-based person re-identification, effectively handling occlusions by focusing on multiple distinctive body parts across frames.
Contribution
It proposes a new attention mechanism with diversity regularization that automatically discovers and leverages multiple body parts for improved re-identification accuracy.
Findings
Outperforms state-of-the-art methods on three datasets
Effectively handles occlusions and misalignments
Learns latent representations of body parts from video sequences
Abstract
Video-based person re-identification matches video clips of people across non-overlapping cameras. Most existing methods tackle this problem by encoding each video frame in its entirety and computing an aggregate representation across all frames. In practice, people are often partially occluded, which can corrupt the extracted features. Instead, we propose a new spatiotemporal attention model that automatically discovers a diverse set of distinctive body parts. This allows useful information to be extracted from all frames without succumbing to occlusions and misalignments. The network learns multiple spatial attention models and employs a diversity regularization term to ensure multiple models do not discover the same body part. Features extracted from local image regions are organized by spatial attention model and are combined using temporal attention. As a result, the network learns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Face recognition and analysis
