Pose-guided Feature Disentangling for Occluded Person Re-identification   Based on Transformer

Tao Wang; Hong Liu; Pinhao Song; Tianyu Guo; Wei Shi

arXiv:2112.02466·cs.CV·December 14, 2021

Pose-guided Feature Disentangling for Occluded Person Re-identification Based on Transformer

Tao Wang, Hong Liu, Pinhao Song, Tianyu Guo, Wei Shi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a transformer-based method for occluded person re-identification that effectively disentangles body parts using pose information, improving matching accuracy in occluded scenarios.

Contribution

The proposed Pose-guided Feature Disentangling (PFD) method leverages transformer architecture and explicit pose-view matching to enhance occluded person re-ID performance.

Findings

01

Outperforms state-of-the-art methods on five datasets

02

Effectively disentangles occluded body parts

03

Improves robustness in occlusion scenarios

Abstract

Occluded person re-identification is a challenging task as human body parts could be occluded by some obstacles (e.g. trees, cars, and pedestrians) in certain scenes. Some existing pose-guided methods solve this problem by aligning body parts according to graph matching, but these graph-based methods are not intuitive and complicated. Therefore, we propose a transformer-based Pose-guided Feature Disentangling (PFD) method by utilizing pose information to clearly disentangle semantic components (e.g. human body or joint parts) and selectively match non-occluded parts correspondingly. First, Vision Transformer (ViT) is used to extract the patch features with its strong capability. Second, to preliminarily disentangle the pose information from patch information, the matching and distributing mechanism is leveraged in Pose-guided Feature Aggregation (PFA) module. Third, a set of learnable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WangTaoAs/PFD_Net
pytorchOfficial

Videos

Pose-Guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer· underline

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Softmax · Residual Connection · Adam · Dropout · Position-Wise Feed-Forward Layer · Layer Normalization · Dense Connections