Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification
Zhizhong Zhang, Jiangming Wang, Xin Tan, Yanyun Qu, Junping Wang, Yong, Xie, Yuan Xie

TL;DR
This paper introduces a mutual information guided optimal transport method for unsupervised visible-infrared person re-identification, effectively addressing cross-modality variance without annotations through an iterative training and matching strategy.
Contribution
It proposes a novel unsupervised learning framework based on mutual information and optimal transport for cross-modality person re-identification, with a loop training strategy and prototype-based contrastive learning.
Findings
Achieves 60.6% Rank-1 accuracy on SYSU-MM01
Achieves 90.3% Rank-1 accuracy on RegDB
Demonstrates effectiveness without using annotations
Abstract
Unsupervised visible infrared person re-identification (USVI-ReID) is a challenging retrieval task that aims to retrieve cross-modality pedestrian images without using any label information. In this task, the large cross-modality variance makes it difficult to generate reliable cross-modality labels, and the lack of annotations also provides additional difficulties for learning modality-invariant features. In this paper, we first deduce an optimization objective for unsupervised VI-ReID based on the mutual information between the model's cross-modality input and output. With equivalent derivation, three learning principles, i.e., "Sharpness" (entropy minimization), "Fairness" (uniform label distribution), and "Fitness" (reliable cross-modality matching) are obtained. Under their guidance, we design a loop iterative training strategy alternating between model training and cross-modality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Infrared Target Detection Methodologies · Image Enhancement Techniques
MethodsContrastive Learning
