TL;DR
This paper introduces Mask Contrastive Face (MCF), a self-supervised pre-training framework that significantly enhances facial representation quality for tasks like face alignment and parsing, outperforming existing methods.
Contribution
The paper proposes a novel self-supervised pre-training method using mask image modeling and contrastive learning tailored for face analysis tasks.
Findings
Pre-trained on LAION-FACE-cropped, MCF outperforms state-of-the-art methods.
Achieves 0.932 NME_diag on AFLW-19 for face alignment.
Achieves 93.96 F1 score on LaPa face parsing.
Abstract
Face analysis tasks have a wide range of applications, but the universal facial representation has only been explored in a few works. In this paper, we explore high-performance pre-training methods to boost the face analysis tasks such as face alignment and face parsing. We propose a self-supervised pre-training framework, called \textbf{\it Mask Contrastive Face (MCF)}, with mask image modeling and a contrastive strategy specially adjusted for face domain tasks. To improve the facial representation quality, we use feature map of a pre-trained visual backbone as a supervision item and use a partially pre-trained decoder for mask image modeling. To handle the face identity during the pre-training stage, we further use random masks to build contrastive learning pairs. We conduct the pre-training on the LAION-FACE-cropped dataset, a variants of LAION-FACE 20M, which contains more than 20…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
