CLIP-Driven Cloth-Agnostic Feature Learning for Cloth-Changing Person   Re-Identification

Shuang Li; Jiaxu Leng; Guozhang Li; Ji Gan; Haosheng chen; Xinbo Gao

arXiv:2406.09198·cs.CV·June 14, 2024·1 cites

CLIP-Driven Cloth-Agnostic Feature Learning for Cloth-Changing Person Re-Identification

Shuang Li, Jiaxu Leng, Guozhang Li, Ji Gan, Haosheng chen, Xinbo Gao

PDF

Open Access

TL;DR

This paper introduces a novel CLIP-based framework that learns cloth-agnostic features for person re-identification under clothing changes, addressing CLIP's focus on clothing clues.

Contribution

The proposed CCAF framework employs IFP and CFM modules to extract and emphasize cloth-agnostic features, improving CC-ReID performance without extra inference time.

Findings

01

Achieves state-of-the-art results on CC-ReID benchmarks.

02

Effectively disentangles clothing features from identity features.

03

No additional inference cost incurred.

Abstract

Contrastive Language-Image Pre-Training (CLIP) has shown impressive performance in short-term Person Re-Identification (ReID) due to its ability to extract high-level semantic features of pedestrians, yet its direct application to Cloth-Changing Person Re-Identification (CC-ReID) faces challenges due to CLIP's image encoder overly focusing on clothes clues. To address this, we propose a novel framework called CLIP-Driven Cloth-Agnostic Feature Learning (CCAF) for CC-ReID. Accordingly, two modules were custom-designed: the Invariant Feature Prompting (IFP) and the Clothes Feature Minimization (CFM). These modules guide the model to extract cloth-agnostic features positively and attenuate clothes-related features negatively. Specifically, IFP is designed to extract fine-grained semantic features unrelated to clothes from the raw image, guided by the cloth-agnostic text prompts. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Video Surveillance and Tracking Methods