Prepended Domain Transformer: Heterogeneous Face Recognition without Bells and Whistles
Anjith George, Amir Mohammadi, Sebastien Marcel

TL;DR
This paper introduces the Prepended Domain Transformer (PDT), a simple yet effective neural network block added to pre-trained face recognition models, enabling high-performance heterogeneous face recognition with minimal paired data and broad applicability.
Contribution
The paper proposes a novel PDT block that can be added to any pre-trained face recognition model, allowing effective cross-domain face matching with minimal retraining and data.
Findings
Achieves state-of-the-art results on multiple HFR benchmarks.
Requires only a few paired samples for retraining the PDT block.
Compatible with various pre-trained face recognition architectures.
Abstract
Heterogeneous Face Recognition (HFR) refers to matching face images captured in different domains, such as thermal to visible images (VIS), sketches to visible images, near-infrared to visible, and so on. This is particularly useful in matching visible spectrum images to images captured from other modalities. Though highly useful, HFR is challenging because of the domain gap between the source and target domain. Often, large-scale paired heterogeneous face image datasets are absent, preventing training models specifically for the heterogeneous task. In this work, we propose a surprisingly simple, yet, very effective method for matching face images across different sensing modalities. The core idea of the proposed approach is to add a novel neural network block called Prepended Domain Transformer (PDT) in front of a pre-trained face recognition (FR) model to address the domain gap.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Biometric Identification and Security
MethodsMulti-Head Attention · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Layer Normalization · Contrastive Learning · Position-Wise Feed-Forward Layer · Residual Connection · Dropout · Adam
