GLCN: Graph-Aware Locality-Enhanced Cross-Modality Re-ID Network
Junjie Cao, Yuhang Yu, Rong Rong, Xing Xie

TL;DR
This paper introduces GLCN, a new network for cross-modality person re-identification that improves accuracy by enhancing local features and aligning structures across different modalities.
Contribution
The novel GLCN framework introduces modules for locality preservation, cross-branch consistency, and geometry alignment, along with a new loss function for compact identity structures.
Findings
GLCN outperforms existing methods on benchmarks like SYSU-MM01 and RegDB.
The proposed modules and loss function effectively reduce intra-class variance and improve inter-class separation.
The framework enhances cross-modality alignment and robustness to illumination and occlusion issues.
Abstract
Cross-modality person re-identification faces challenges such as illumination discrepancies, local occlusions, and inconsistent modality structures, leading to misalignment and sensitivity issues. We propose GLCN, a framework that addresses these problems by enhancing representation learning through locality enhancement, cross-modality structural alignment, and intra-modality compactness. Key components include the Locality-Preserved Cross-branch Fusion (LPCF) module, which combines Local–Positional–Channel Gating (LPCG) for local region and positional sensitivity; Cross-branch Context Interpolated Attention (CCIA) for stable cross-branch consistency; and Graph-Enhanced Center Geometry Alignment (GE-CGA), which aligns class-center similarity structures across modalities to preserve category-level relationships. We also introduce Intra-Modal Prototype Discrepancy Mining Loss (IPDM-Loss)…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
