Diverse Semantics-Guided Feature Alignment and Decoupling for   Visible-Infrared Person Re-Identification

Neng Dong; Shuanglin Yan; Liyan Zhang; Jinhui Tang

arXiv:2505.00619·cs.CV·May 2, 2025

Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification

Neng Dong, Shuanglin Yan, Liyan Zhang, Jinhui Tang

PDF

TL;DR

This paper introduces a novel method for visible-infrared person re-identification that aligns and decouples features using diverse semantics and style filtering, improving cross-modality matching accuracy.

Contribution

The paper proposes a new DSFAD network with modules for diverse semantics-guided alignment, style-aware feature decoupling, and semantic consistency-based restitution, advancing VI-ReID performance.

Findings

01

Outperforms existing methods on three VI-ReID datasets.

02

Effectively disentangles identity-related and style-related features.

03

Enhances cross-modality feature alignment and discriminability.

Abstract

Visible-Infrared Person Re-Identification (VI-ReID) is a challenging task due to the large modality discrepancy between visible and infrared images, which complicates the alignment of their features into a suitable common space. Moreover, style noise, such as illumination and color contrast, reduces the identity discriminability and modality invariance of features. To address these challenges, we propose a novel Diverse Semantics-guided Feature Alignment and Decoupling (DSFAD) network to align identity-relevant features from different modalities into a textual embedding space and disentangle identity-irrelevant features within each modality. Specifically, we develop a Diverse Semantics-guided Feature Alignment (DSFA) module, which generates pedestrian descriptions with diverse sentence structures to guide the cross-modality alignment of visual features. Furthermore, to filter out style…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsALIGN