Multi-Level Attention for Unsupervised Person Re-Identification

Yi Zheng

arXiv:2201.03141·cs.CV·January 11, 2022

Multi-Level Attention for Unsupervised Person Re-Identification

Yi Zheng

PDF

Open Access

TL;DR

This paper introduces a multi-level attention framework for unsupervised person re-identification, combining pixel, head, and domain-level attention to improve feature extraction and identification accuracy.

Contribution

It proposes a novel multi-level attention block that addresses attention spreading and enhances pedestrian feature representation in unsupervised re-identification.

Findings

01

Improved re-identification accuracy on Market-1501, DukeMTMC-reID, MSMT17, and PersonX datasets.

02

Effective suppression of attention spreading in unsupervised settings.

03

Enhanced feature representation through combined multi-level attention.

Abstract

The attention mechanism is widely used in deep learning because of its excellent performance in neural networks without introducing additional information. However, in unsupervised person re-identification, the attention module represented by multi-headed self-attention suffers from attention spreading in the condition of non-ground truth. To solve this problem, we design pixel-level attention module to provide constraints for multi-headed self-attention. Meanwhile, for the trait that the identification targets of person re-identification data are all pedestrians in the samples, we design domain-level attention module to provide more comprehensive pedestrian features. We combine head-level, pixel-level and domain-level attention to propose multi-level attention block and validate its performance on for large person re-identification datasets (Market-1501, DukeMTMC-reID and MSMT17 and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Automated Road and Building Extraction · Gait Recognition and Analysis