CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification
Chaoyou Fu, Yibo Hu, Xiang Wu, Hailin Shi, Tao Mei, Ran He

TL;DR
This paper introduces CM-NAS, a neural architecture search method focused on optimizing Batch Normalization layer separation to improve cross-modality person re-identification between visible and infrared images, outperforming existing methods.
Contribution
The paper proposes a novel BN-oriented neural architecture search method specifically designed for cross-modality person re-identification tasks.
Findings
Outperforms state-of-the-art on SYSU-MM01 and RegDB benchmarks.
Achieves 6.70%/6.13% improvement in Rank-1/mAP on SYSU-MM01.
Achieves 12.17%/11.23% improvement in Rank-1/mAP on RegDB.
Abstract
Visible-Infrared person re-identification (VI-ReID) aims to match cross-modality pedestrian images, breaking through the limitation of single-modality person ReID in dark environment. In order to mitigate the impact of large modality discrepancy, existing works manually design various two-stream architectures to separately learn modality-specific and modality-sharable representations. Such a manual design routine, however, highly depends on massive experiments and empirical practice, which is time consuming and labor intensive. In this paper, we systematically study the manually designed architectures, and identify that appropriately separating Batch Normalization (BN) layers is the key to bring a great boost towards cross-modality matching. Based on this observation, the essential objective is to find the optimal separation scheme for each BN layer. To this end, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Human Pose and Action Recognition
MethodsBatch Normalization
