ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Can Cui, Siteng Huang, Wenxuan Song, Pengxiang Ding, Min Zhang,, Donglin Wang

TL;DR
ProFD introduces a prompt-guided feature disentangling approach leveraging pre-trained textual knowledge to improve occluded person re-identification, achieving state-of-the-art results by aligning visual and textual features and reducing noise.
Contribution
The paper proposes a novel method that uses textual prompts and a hybrid-attention decoder to enhance feature alignment and robustness in occluded person ReID tasks.
Findings
ProFD outperforms existing methods on multiple datasets.
It effectively aligns visual and textual features despite occlusions.
The approach reduces noise impact and mitigates over-fitting.
Abstract
To address the occlusion issues in person Re-Identification (ReID) tasks, many methods have been proposed to extract part features by introducing external spatial information. However, due to missing part appearance information caused by occlusion and noisy spatial information from external model, these purely vision-based approaches fail to correctly learn the features of human body parts from limited training data and struggle in accurately locating body parts, ultimately leading to misaligned part features. To tackle these challenges, we propose a Prompt-guided Feature Disentangling method (ProFD), which leverages the rich pre-trained knowledge in the textual modality facilitate model to generate well-aligned part features. ProFD first designs part-specific prompts and utilizes noisy segmentation mask to preliminarily align visual and textual embedding, enabling the textual prompts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition
MethodsContrastive Language-Image Pre-training · ALIGN
