Temporal Prototyping and Hierarchical Alignment for Unsupervised Video-based Visible-Infrared Person Re-Identification
Zhiyong Li, Wei Jiang, Haojie Liu, Mingyu Wang, Wanchong Xu, Weijie Mao

TL;DR
This paper introduces HiTPro, an unsupervised video-based visible-infrared person re-identification framework that leverages hierarchical prototypes and contrastive learning to improve cross-modality matching without identity labels.
Contribution
The paper proposes a novel hierarchical prototype-driven framework for unsupervised video VI-ReID, eliminating the need for explicit pseudo-labels and enhancing cross-modality feature alignment.
Findings
Achieves state-of-the-art performance on HITSZ-VCM and BUPTCampus datasets.
Outperforms existing unsupervised methods significantly.
Demonstrates effective hierarchical prototype and contrastive learning strategies.
Abstract
Visible-infrared person re-identification (VI-ReID) enables cross-modality identity matching for all-day surveillance, yet existing methods predominantly focus on the image level or rely heavily on costly identity annotations. While video-based VI-ReID has recently emerged to exploit temporal dynamics for improved robustness, existing studies remain limited to supervised settings. Crucially, the unsupervised video VI-ReID problem, where models must learn from RGB and infrared tracklets without identity labels, remains largely unexplored despite its practical importance in real-world deployment. To bridge this gap, we propose HiTPro (Hierarchical Temporal Prototyping), a prototype-driven framework without explicit hard pseudo-label assignment for unsupervised video-based VI-ReID. HiTPro begins with an efficient Temporal-aware Feature Encoder that first extracts discriminative frame-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
