Learning Deep Context-aware Features over Body and Latent Parts for   Person Re-identification

Dangwei Li; Xiaotang Chen; Zhang Zhang; Kaiqi Huang

arXiv:1710.06555·cs.CV·October 19, 2017·90 cites

Learning Deep Context-aware Features over Body and Latent Parts for Person Re-identification

Dangwei Li, Xiaotang Chen, Zhang Zhang, Kaiqi Huang

PDF

Open Access

TL;DR

This paper introduces a multi-scale, context-aware deep network that learns both full-body and deformable body parts for person re-identification, significantly improving accuracy on challenging datasets.

Contribution

It proposes a novel multi-scale network with deformable part localization using Spatial Transformer Networks, enhancing feature robustness for person ReID.

Findings

01

Achieves state-of-the-art results on Market1501, CUHK03, and MARS datasets.

02

Effectively handles pose variations and background clutter.

03

Outperforms existing methods in person re-identification accuracy.

Abstract

Person Re-identification (ReID) is to identify the same person across different cameras. It is a challenging task due to the large variations in person pose, occlusion, background clutter, etc How to extract powerful features is a fundamental problem in ReID and is still an open problem today. In this paper, we design a Multi-Scale Context-Aware Network (MSCAN) to learn powerful features over full body and body parts, which can well capture the local context knowledge by stacking multi-scale convolutions in each layer. Moreover, instead of using predefined rigid parts, we propose to learn and localize deformable pedestrian parts using Spatial Transformer Networks (STN) with novel spatial constraints. The learned body parts can release some difficulties, eg pose variations and background clutters, in part-based representation. Finally, we integrate the representation learning processes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Spatial Transformer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam