CA3Net: Contextual-Attentional Attribute-Appearance Network for Person   Re-Identification

Jiawei Liu; Zheng-Jun Zha; Hongtao Xie; Zhiwei Xiong and; Yongdong Zhang

arXiv:1811.07544·cs.CV·November 20, 2018·20 cites

CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification

Jiawei Liu, Zheng-Jun Zha, Hongtao Xie, Zhiwei Xiong and, Yongdong Zhang

PDF

Open Access

TL;DR

This paper introduces CA3Net, a novel deep learning model that combines attribute and appearance features with attention mechanisms and spatial dependencies to improve person re-identification accuracy across camera views.

Contribution

The paper presents a new network architecture that jointly learns attribute and appearance features using attention and LSTM modules for robust person re-identification.

Findings

01

Outperforms existing methods on Market-1501 and DukeMTMC-reID datasets.

02

Effectively exploits semantic attribute context and spatial dependencies.

03

Achieves higher re-identification accuracy with comprehensive pedestrian representations.

Abstract

Person re-identification aims to identify the same pedestrian across non-overlapping camera views. Deep learning techniques have been applied for person re-identification recently, towards learning representation of pedestrian appearance. This paper presents a novel Contextual-Attentional Attribute-Appearance Network (CA3Net) for person re-identification. The CA3Net simultaneously exploits the complementarity between semantic attributes and visual appearance, the semantic context among attributes, visual attention on attributes as well as spatial dependencies among body parts, leading to discriminative and robust pedestrian representation. Specifically, an attribute network within CA3Net is designed with an Attention-LSTM module. It concentrates the network on latent image regions related to each attribute as well as exploits the semantic context among attributes by a LSTM module. An…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Human Pose and Action Recognition

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory