Face Transformer for Recognition

Yaoyao Zhong; Weihong Deng

arXiv:2103.14803·cs.CV·April 14, 2021·45 cites

Face Transformer for Recognition

Yaoyao Zhong, Weihong Deng

PDF

Open Access 2 Repos

TL;DR

This paper explores the use of Transformer models for face recognition, modifying the patch generation process to improve inter-patch information, and demonstrates comparable performance to CNNs on standard benchmarks.

Contribution

It introduces a novel Face Transformer model with overlapping patches for face recognition and evaluates its performance against CNNs on multiple datasets.

Findings

01

Face Transformer achieves comparable accuracy to CNNs.

02

Modified patch generation improves inter-patch information.

03

Models trained on large datasets perform well on benchmarks.

Abstract

Recently there has been a growing interest in Transformer not only in NLP but also in computer vision. We wonder if transformer can be used in face recognition and whether it is better than CNNs. Therefore, we investigate the performance of Transformer models in face recognition. Considering the original Transformer may neglect the inter-patch information, we modify the patch generation process and make the tokens with sliding patches which overlaps with each others. The models are trained on CASIA-WebFace and MS-Celeb-1M databases, and evaluated on several mainstream benchmarks, including LFW, SLLFW, CALFW, CPLFW, TALFW, CFP-FP, AGEDB and IJB-C databases. We demonstrate that Face Transformer models trained on a large-scale database, MS-Celeb-1M, achieve comparable performance as CNN with similar number of parameters and MACs. To facilitate further researches, Face Transformer models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Face and Expression Recognition · Biometric Identification and Security

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Softmax · Dense Connections · Attention Is All You Need · Dropout · Layer Normalization · Residual Connection