Deep Recurrent Convolutional Networks for Video-based Person   Re-identification: An End-to-End Approach

Lin Wu; Chunhua Shen; Anton van den Hengel

arXiv:1606.01609·cs.CV·June 14, 2016·56 cites

Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

Lin Wu, Chunhua Shen, Anton van den Hengel

PDF

Open Access

TL;DR

This paper introduces an end-to-end deep learning framework combining convolutional, recurrent, and pooling layers to improve video-based person re-identification, achieving state-of-the-art results on public datasets.

Contribution

It presents a novel integrated deep network architecture that jointly learns spatio-temporal features and similarity metrics for person re-identification in videos.

Findings

01

Achieves state-of-the-art performance on iLIDS-VID dataset.

02

Outperforms previous methods on PRID 2011 dataset.

03

Effectively models fine motion patterns and temporal variations.

Abstract

In this paper, we present an end-to-end approach to simultaneously learn spatio-temporal features and corresponding similarity metric for video-based person re-identification. Given the video sequence of a person, features from each frame that are extracted from all levels of a deep convolutional network can preserve a higher spatial resolution from which we can model finer motion patterns. These low-level visual percepts are leveraged into a variant of recurrent model to characterize the temporal variation between time-steps. Features from all time-steps are then summarized using temporal pooling to produce an overall feature representation for the complete sequence. The deep convolutional network, recurrent layer, and the temporal pooling are jointly trained to extract comparable hidden-unit representations from input pair of time series to compute their corresponding similarity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Gait Recognition and Analysis