Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification
Neng Dong, Shuanglin Yan, Liyan Zhang, Jinhui Tang

TL;DR
This paper introduces a novel method for visible-infrared person re-identification that aligns and decouples features using diverse semantics and style filtering, improving cross-modality matching accuracy.
Contribution
The paper proposes a new DSFAD network with modules for diverse semantics-guided alignment, style-aware feature decoupling, and semantic consistency-based restitution, advancing VI-ReID performance.
Findings
Outperforms existing methods on three VI-ReID datasets.
Effectively disentangles identity-related and style-related features.
Enhances cross-modality feature alignment and discriminability.
Abstract
Visible-Infrared Person Re-Identification (VI-ReID) is a challenging task due to the large modality discrepancy between visible and infrared images, which complicates the alignment of their features into a suitable common space. Moreover, style noise, such as illumination and color contrast, reduces the identity discriminability and modality invariance of features. To address these challenges, we propose a novel Diverse Semantics-guided Feature Alignment and Decoupling (DSFAD) network to align identity-relevant features from different modalities into a textual embedding space and disentangle identity-irrelevant features within each modality. Specifically, we develop a Diverse Semantics-guided Feature Alignment (DSFA) module, which generates pedestrian descriptions with diverse sentence structures to guide the cross-modality alignment of visual features. Furthermore, to filter out style…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
