Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition
Ran He, Xiang Wu, Zhenan Sun, Tieniu Tan

TL;DR
This paper introduces Wasserstein CNN, a novel deep learning framework that learns modality-invariant features for NIR-VIS face recognition by minimizing Wasserstein distance between feature distributions, improving recognition accuracy across modalities.
Contribution
The paper proposes Wasserstein CNN with a shared layer using Wasserstein distance and a correlation prior to effectively learn invariant features for heterogeneous face recognition.
Findings
Outperforms state-of-the-art methods on NIR-VIS databases.
Effectively minimizes distribution dissimilarity between modalities.
Reduces overfitting with low-rank correlation prior.
Abstract
Heterogeneous face recognition (HFR) aims to match facial images acquired from different sensing modalities with mission-critical applications in forensics, security and commercial sectors. However, HFR is a much more challenging problem than traditional face recognition because of large intra-class variations of heterogeneous face images and limited training samples of cross-modality face image pairs. This paper proposes a novel approach namely Wasserstein CNN (convolutional neural networks, or WCNN for short) to learn invariant features between near-infrared and visual face images (i.e. NIR-VIS face recognition). The low-level layers of WCNN are trained with widely available face images in visual spectrum. The high-level layer is divided into three parts, i.e., NIR layer, VIS layer and NIR-VIS shared layer. The first two layers aims to learn modality-specific features and NIR-VIS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Biometric Identification and Security · Face and Expression Recognition
