Prepended Domain Transformer: Heterogeneous Face Recognition without   Bells and Whistles

Anjith George; Amir Mohammadi; Sebastien Marcel

arXiv:2210.06529·cs.CV·October 14, 2022·1 cites

Prepended Domain Transformer: Heterogeneous Face Recognition without Bells and Whistles

Anjith George, Amir Mohammadi, Sebastien Marcel

PDF

Open Access 2 Repos

TL;DR

This paper introduces the Prepended Domain Transformer (PDT), a simple yet effective neural network block added to pre-trained face recognition models, enabling high-performance heterogeneous face recognition with minimal paired data and broad applicability.

Contribution

The paper proposes a novel PDT block that can be added to any pre-trained face recognition model, allowing effective cross-domain face matching with minimal retraining and data.

Findings

01

Achieves state-of-the-art results on multiple HFR benchmarks.

02

Requires only a few paired samples for retraining the PDT block.

03

Compatible with various pre-trained face recognition architectures.

Abstract

Heterogeneous Face Recognition (HFR) refers to matching face images captured in different domains, such as thermal to visible images (VIS), sketches to visible images, near-infrared to visible, and so on. This is particularly useful in matching visible spectrum images to images captured from other modalities. Though highly useful, HFR is challenging because of the domain gap between the source and target domain. Often, large-scale paired heterogeneous face image datasets are absent, preventing training models specifically for the heterogeneous task. In this work, we propose a surprisingly simple, yet, very effective method for matching face images across different sensing modalities. The core idea of the proposed approach is to add a novel neural network block called Prepended Domain Transformer (PDT) in front of a pre-trained face recognition (FR) model to address the domain gap.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Face and Expression Recognition · Biometric Identification and Security

MethodsMulti-Head Attention · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Layer Normalization · Contrastive Learning · Position-Wise Feed-Forward Layer · Residual Connection · Dropout · Adam