Short Range Correlation Transformer for Occluded Person Re-Identification
Yunbin Zhao, Songhao Zhu, Dongsheng Wang, Zhiwei Liang

TL;DR
This paper introduces PFT, a novel transformer-based framework for occluded person re-identification that enhances local feature extraction and short-range correlation, outperforming existing methods.
Contribution
The paper proposes a partial feature transformer (PFT) with three modules to improve local feature extraction and short-range correlation in occluded person re-identification.
Findings
PFT achieves superior performance on occluded and holistic datasets.
The spatial slicing module improves short-range correlation.
Experimental results outperform state-of-the-art methods.
Abstract
Occluded person re-identification is one of the challenging areas of computer vision, which faces problems such as inefficient feature representation and low recognition accuracy. Convolutional neural network pays more attention to the extraction of local features, therefore it is difficult to extract features of occluded pedestrians and the effect is not so satisfied. Recently, vision transformer is introduced into the field of re-identification and achieves the most advanced results by constructing the relationship of global features between patch sequences. However, the performance of vision transformer in extracting local features is inferior to that of convolutional neural network. Therefore, we design a partial feature transformer-based person re-identification framework named PFT. The proposed PFT utilizes three modules to enhance the efficiency of vision transformer. (1) Patch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Image Enhancement Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Softmax · Dense Connections · Layer Normalization · Vision Transformer
