Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person   Re-Identification

Mang Ye; Jianbing Shen; David J. Crandall; Ling Shao; Jiebo Luo

arXiv:2007.09314·cs.CV·July 21, 2020·32 cites

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification

Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo

PDF

Open Access 5 Repos

TL;DR

This paper introduces a novel dynamic dual-attentive aggregation learning method for visible-infrared person re-identification, effectively capturing intra-modality and cross-modality cues to improve discriminability and robustness against noise.

Contribution

The paper proposes a dynamic dual-attentive aggregation approach that mines intra-modality and cross-modality contextual cues, enhancing feature discriminability and robustness in VI-ReID.

Findings

01

Outperforms state-of-the-art methods in various settings

02

Effective in handling noisy samples and large intra-class variations

03

Improves discriminative part feature extraction

Abstract

Visible-infrared person re-identification (VI-ReID) is a challenging cross-modality pedestrian retrieval problem. Due to the large intra-class variations and cross-modality discrepancy with large amount of sample noise, it is difficult to learn discriminative part features. Existing VI-ReID methods instead tend to learn global representations, which have limited discriminability and weak robustness to noisy images. In this paper, we propose a novel dynamic dual-attentive aggregation (DDAG) learning method by mining both intra-modality part-level and cross-modality graph-level contextual cues for VI-ReID. We propose an intra-modality weighted-part attention module to extract discriminative part-aggregated features, by imposing the domain knowledge on the part relationship mining. To enhance robustness against noisy samples, we introduce cross-modality graph structured attention to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Advanced Neural Network Applications