ProFD: Prompt-Guided Feature Disentangling for Occluded Person   Re-Identification

Can Cui; Siteng Huang; Wenxuan Song; Pengxiang Ding; Min Zhang,; Donglin Wang

arXiv:2409.20081·cs.CV·October 1, 2024

ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification

Can Cui, Siteng Huang, Wenxuan Song, Pengxiang Ding, Min Zhang,, Donglin Wang

PDF

Open Access 1 Repo

TL;DR

ProFD introduces a prompt-guided feature disentangling approach leveraging pre-trained textual knowledge to improve occluded person re-identification, achieving state-of-the-art results by aligning visual and textual features and reducing noise.

Contribution

The paper proposes a novel method that uses textual prompts and a hybrid-attention decoder to enhance feature alignment and robustness in occluded person ReID tasks.

Findings

01

ProFD outperforms existing methods on multiple datasets.

02

It effectively aligns visual and textual features despite occlusions.

03

The approach reduces noise impact and mitigates over-fitting.

Abstract

To address the occlusion issues in person Re-Identification (ReID) tasks, many methods have been proposed to extract part features by introducing external spatial information. However, due to missing part appearance information caused by occlusion and noisy spatial information from external model, these purely vision-based approaches fail to correctly learn the features of human body parts from limited training data and struggle in accurately locating body parts, ultimately leading to misaligned part features. To tackle these challenges, we propose a Prompt-guided Feature Disentangling method (ProFD), which leverages the rich pre-trained knowledge in the textual modality facilitate model to generate well-aligned part features. ProFD first designs part-specific prompts and utilizes noisy segmentation mask to preliminarily align visual and textual embedding, enabling the textual prompts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cuixxx/profd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition

MethodsContrastive Language-Image Pre-training · ALIGN