Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real
Yan Yang, George Bebis, Mircea Nicolescu

TL;DR
This paper introduces a two-step generative data augmentation method combining rule-based mask warping and GAN-based image translation to improve masked face detection and recognition, especially with limited data.
Contribution
It presents a novel two-step augmentation framework that surpasses rule-based methods alone and complements existing GAN approaches, with new loss functions and diversity techniques.
Findings
Improved masked face sample quality over rule-based warping
Enhanced detection and recognition performance with augmented data
Effective augmentation with limited target domain images
Abstract
The absence of large-scale masked face datasets challenges masked face detection and recognition. We propose a two-step generative data augmentation framework combining rule-based mask warping with unpaired image-to-image translation via GANs, producing masked face samples that go beyond rule-based overlays. Trained on about 19,100 images in the target domain (3.8% of IAMGAN's scale), or, including out-of-domain transfer pretraining, 59,600 and 11.8%, the proposed approach yields consistent improvements over rule-based warping alone and achieves results complementary to IAMGAN's, showing that both steps contribute. Evaluation is conducted directly on the generated samples and is qualitative; quantitative metrics like FID and KID were not applied as any real reference distribution would unfairly favor the model with closer training data. We introduce a non-mask preservation loss to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face and Expression Recognition
