CLIP-based Camera-Agnostic Feature Learning for Intra-camera Person Re-Identification
Xuan Tan, Xun Gong, Yang Xiang

TL;DR
This paper introduces CCAFL, a novel CLIP-based framework for intra-camera person re-identification that learns camera-agnostic features through intra-camera discriminative and inter-camera adversarial learning, significantly improving accuracy.
Contribution
The paper proposes a new CLIP-based framework with custom modules for intra-camera discriminative and inter-camera adversarial learning, addressing intra-camera ReID challenges.
Findings
Achieves 58.9% mAP on MSMT17, surpassing state-of-the-art by 7.6%.
Effectively learns camera-agnostic pedestrian features.
Demonstrates superior performance on popular ReID datasets.
Abstract
Contrastive Language-Image Pre-Training (CLIP) model excels in traditional person re-identification (ReID) tasks due to its inherent advantage in generating textual descriptions for pedestrian images. However, applying CLIP directly to intra-camera supervised person re-identification (ICS ReID) presents challenges. ICS ReID requires independent identity labeling within each camera, without associations across cameras. This limits the effectiveness of text-based enhancements. To address this, we propose a novel framework called CLIP-based Camera-Agnostic Feature Learning (CCAFL) for ICS ReID. Accordingly, two custom modules are designed to guide the model to actively learn camera-agnostic pedestrian features: Intra-Camera Discriminative Learning (ICDL) and Inter-Camera Adversarial Learning (ICAL). Specifically, we first establish learnable textual prompts for intra-camera pedestrian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Gait Recognition and Analysis
MethodsContrastive Language-Image Pre-training
