Ensemble Learning using Transformers and Convolutional Networks for Masked Face Recognition
Mohammed R. Al-Sinan, Aseel F. Haneef, Hamzah Luqman

TL;DR
This paper introduces an ensemble system combining CNNs and Transformers for masked face recognition, achieving 92% accuracy on a synthetic dataset, addressing challenges posed by face masks in recognition tasks.
Contribution
The study presents a novel ensemble approach using CNNs and Transformers specifically designed for recognizing masked faces, improving accuracy over existing models.
Findings
Ensemble of CNNs and Transformers achieves 92% accuracy.
The proposed system outperforms individual models in masked face recognition.
Synthetic masked dataset used for evaluation demonstrates robustness of the approach.
Abstract
Wearing a face mask is one of the adjustments we had to follow to reduce the spread of the coronavirus. Having our faces covered by masks constantly has driven the need to understand and investigate how this behavior affects the recognition capability of face recognition systems. Current face recognition systems have extremely high accuracy when dealing with unconstrained general face recognition cases but do not generalize well with occluded masked faces. In this work, we propose a system for masked face recognition. The proposed system comprises two Convolutional Neural Network (CNN) models and two Transformer models. The CNN models have been fine-tuned on FaceNet pre-trained model. We ensemble the predictions of the four models using the majority voting technique to identify the person with the mask. The proposed system has been evaluated on a synthetically masked LFW dataset created…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Residual Connection · Dropout · Softmax · Label Smoothing · Adam · Dense Connections
