CLIP-based Camera-Agnostic Feature Learning for Intra-camera Person   Re-Identification

Xuan Tan; Xun Gong; Yang Xiang

arXiv:2409.19563·cs.CV·October 1, 2024

CLIP-based Camera-Agnostic Feature Learning for Intra-camera Person Re-Identification

Xuan Tan, Xun Gong, Yang Xiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces CCAFL, a novel CLIP-based framework for intra-camera person re-identification that learns camera-agnostic features through intra-camera discriminative and inter-camera adversarial learning, significantly improving accuracy.

Contribution

The paper proposes a new CLIP-based framework with custom modules for intra-camera discriminative and inter-camera adversarial learning, addressing intra-camera ReID challenges.

Findings

01

Achieves 58.9% mAP on MSMT17, surpassing state-of-the-art by 7.6%.

02

Effectively learns camera-agnostic pedestrian features.

03

Demonstrates superior performance on popular ReID datasets.

Abstract

Contrastive Language-Image Pre-Training (CLIP) model excels in traditional person re-identification (ReID) tasks due to its inherent advantage in generating textual descriptions for pedestrian images. However, applying CLIP directly to intra-camera supervised person re-identification (ICS ReID) presents challenges. ICS ReID requires independent identity labeling within each camera, without associations across cameras. This limits the effectiveness of text-based enhancements. To address this, we propose a novel framework called CLIP-based Camera-Agnostic Feature Learning (CCAFL) for ICS ReID. Accordingly, two custom modules are designed to guide the model to actively learn camera-agnostic pedestrian features: Intra-Camera Discriminative Learning (ICDL) and Inter-Camera Adversarial Learning (ICAL). Specifically, we first establish learnable textual prompts for intra-camera pedestrian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Trangle12/CCAFL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Gait Recognition and Analysis

MethodsContrastive Language-Image Pre-training