A Spatial and Temporal Features Mixture Model with Body Parts for   Video-based Person Re-Identification

Jie Liu; Cheng Sun; Xiang Xu; Baomin Xu; Shuangyuan Yu

arXiv:1807.00975·cs.CV·July 4, 2018

A Spatial and Temporal Features Mixture Model with Body Parts for Video-based Person Re-Identification

Jie Liu, Cheng Sun, Xiang Xu, Baomin Xu, Shuangyuan Yu

PDF

Open Access

TL;DR

This paper introduces a novel CNN-RNN based model that splits human body parts in video sequences to extract and integrate spatial and temporal features, significantly improving person re-identification accuracy across datasets.

Contribution

The paper proposes a new spatial-temporal feature mixture model that splits human body into parts for more detailed feature extraction, enhancing re-identification performance.

Findings

01

Achieves 74% rank-1 accuracy on iLIDS-VID, surpassing previous methods by 12%.

02

Attains 48% rank-1 accuracy in cross-data testing, outperforming prior approaches by 18%.

03

Demonstrates the model's stability and effectiveness across different datasets.

Abstract

The video-based person re-identification is to recognize a person under different cameras, which is a crucial task applied in visual surveillance system. Most previous methods mainly focused on the feature of full body in the frame. In this paper we propose a novel Spatial and Temporal Features Mixture Model (STFMM) based on convolutional neural network (CNN) and recurrent neural network (RNN), in which the human body is split into $N$ parts in horizontal direction so that we can obtain more specific features. The proposed method skillfully integrates features of each part to achieve more expressive representation of each person. We first split the video sequence into $N$ part sequences which include the information of head, waist, legs and so on. Then the features are extracted by STFMM whose $2 N$ inputs are obtained from the developed Siamese network, and these features are combined…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Gait Recognition and Analysis