TL;DR
This paper introduces ICPL, a novel identity-conditional prompt learning framework leveraging CLIP for multi-spectral object re-identification, effectively addressing spectral modality differences and improving performance across multiple benchmarks.
Contribution
The paper proposes a new ICPL framework that uses online prompt learning, identity prototypes, and multi-spectral adapters to enhance spectral feature alignment and re-identification accuracy.
Findings
Outperforms state-of-the-art methods on 5 benchmarks.
Effectively handles spectral modality differences.
Improves spectral feature alignment and re-identification accuracy.
Abstract
Multi-spectral object re-identification (ReID) brings a new perception perspective for smart city and intelligent transportation applications, effectively addressing challenges from complex illumination and adverse weather. However, complex modal differences between heterogeneous spectra pose challenges to efficiently utilizing complementary and discrepancy of spectra information. Most existing methods fuse spectral data through intricate modal interaction modules, lacking fine-grained semantic understanding of spectral information (\textit{e.g.}, text descriptions, part masks, and object keypoints). To solve this challenge, we propose a novel Identity-Conditional text Prompt Learning framework (ICPL), which exploits the powerful cross-modal alignment capability of CLIP, to unify different spectral visual features from text semantics. Specifically, we first propose the online prompt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Language-Image Pre-training · Adapter
