Pose-guided Feature Disentangling for Occluded Person Re-identification Based on Transformer
Tao Wang, Hong Liu, Pinhao Song, Tianyu Guo, Wei Shi

TL;DR
This paper introduces a transformer-based method for occluded person re-identification that effectively disentangles body parts using pose information, improving matching accuracy in occluded scenarios.
Contribution
The proposed Pose-guided Feature Disentangling (PFD) method leverages transformer architecture and explicit pose-view matching to enhance occluded person re-ID performance.
Findings
Outperforms state-of-the-art methods on five datasets
Effectively disentangles occluded body parts
Improves robustness in occlusion scenarios
Abstract
Occluded person re-identification is a challenging task as human body parts could be occluded by some obstacles (e.g. trees, cars, and pedestrians) in certain scenes. Some existing pose-guided methods solve this problem by aligning body parts according to graph matching, but these graph-based methods are not intuitive and complicated. Therefore, we propose a transformer-based Pose-guided Feature Disentangling (PFD) method by utilizing pose information to clearly disentangle semantic components (e.g. human body or joint parts) and selectively match non-occluded parts correspondingly. First, Vision Transformer (ViT) is used to extract the patch features with its strong capability. Second, to preliminarily disentangle the pose information from patch information, the matching and distributing mechanism is leveraged in Pose-guided Feature Aggregation (PFA) module. Third, a set of learnable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Softmax · Residual Connection · Adam · Dropout · Position-Wise Feed-Forward Layer · Layer Normalization · Dense Connections
