Arc2Face: A Foundation Model for ID-Consistent Human Faces
Foivos Paraperas Papantoniou, Alexandros Lattas, Stylianos Moschoglou,, Jiankang Deng, Bernhard Kainz, Stefanos Zafeiriou

TL;DR
Arc2Face is a novel foundation model that generates highly ID-consistent, photo-realistic human face images from ArcFace embeddings, leveraging an upsampled large-scale dataset and adapting a pretrained diffusion model.
Contribution
The paper introduces Arc2Face, a face generation model conditioned solely on identity features, improving face similarity and ID consistency over existing methods.
Findings
Achieves higher face similarity than previous models.
Uses an upsampled WebFace42M dataset for better identity reconstruction.
Training on synthetic images improves face recognition performance.
Abstract
This paper presents Arc2Face, an identity-conditioned face foundation model, which, given the ArcFace embedding of a person, can generate diverse photo-realistic images with an unparalleled degree of face similarity than existing models. Despite previous attempts to decode face recognition features into detailed images, we find that common high-resolution datasets (e.g. FFHQ) lack sufficient identities to reconstruct any subject. To that end, we meticulously upsample a significant portion of the WebFace42M database, the largest public dataset for face recognition (FR). Arc2Face builds upon a pretrained Stable Diffusion model, yet adapts it to the task of ID-to-face generation, conditioned solely on ID vectors. Deviating from recent works that combine ID with text embeddings for zero-shot personalization of text-to-image models, we emphasize on the compactness of FR features, which can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · 3D Shape Modeling and Analysis · Face and Expression Recognition
MethodsHuMan(Expedia)||How do I get a human at Expedia? · Dense Connections · Max Pooling · Adaptive Instance Normalization · R1 Regularization · Concatenated Skip Connection · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · StyleGAN · Feedforward Network
