Where-and-When to Look: Deep Siamese Attention Networks for Video-based   Person Re-identification

Lin Wu; Yang Wang; Junbin Gao; Xue Li

arXiv:1808.01911·cs.CV·October 17, 2018·5 cites

Where-and-When to Look: Deep Siamese Attention Networks for Video-based Person Re-identification

Lin Wu, Yang Wang, Junbin Gao, Xue Li

PDF

Open Access

TL;DR

This paper introduces a deep Siamese attention network that jointly learns spatiotemporal features and similarity metrics for video-based person re-identification, effectively focusing on relevant regions across frames to improve matching accuracy.

Contribution

It proposes a novel Siamese attention architecture that integrates spatial and temporal attention mechanisms within a unified model for person re-id.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets

02

Effectively captures discriminative local features in videos

03

Jointly learns feature representations and similarity metrics

Abstract

Video-based person re-identification (re-id) is a central application in surveillance systems with significant concern in security. Matching persons across disjoint camera views in their video fragments is inherently challenging due to the large visual variations and uncontrolled frame rates. There are two steps crucial to person re-id, namely discriminative feature learning and metric learning. However, existing approaches consider the two steps independently, and they do not make full use of the temporal and spatial information in videos. In this paper, we propose a Siamese attention architecture that jointly learns spatiotemporal video representations and their similarity metrics. The network extracts local convolutional features from regions of each frame, and enhance their discriminative capability by focusing on distinct regions when measuring the similarity with another…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Gait Recognition and Analysis