InstaFace: Identity-Preserving Facial Editing with Single Image Inference
MD Wahiduzzaman Khan, Mingshan Jia, Xiaolin Zhang, En Yu, Caifeng, Shan, Kaska Musial-Gabrys

TL;DR
InstaFace is a diffusion-based framework that enables realistic facial editing from a single image while effectively preserving identity and contextual features, addressing limitations of previous methods.
Contribution
We introduce a novel diffusion model with an efficient guidance network and feature embedding modules for identity-preserving facial editing from a single image.
Findings
Outperforms state-of-the-art in identity preservation
Achieves high photorealism in edited images
Effective control over pose, expression, and lighting
Abstract
Facial appearance editing is crucial for digital avatars, AR/VR, and personalized content creation, driving realistic user experiences. However, preserving identity with generative models is challenging, especially in scenarios with limited data availability. Traditional methods often require multiple images and still struggle with unnatural face shifts, inconsistent hair alignment, or excessive smoothing effects. To overcome these challenges, we introduce a novel diffusion-based framework, InstaFace, to generate realistic images while preserving identity using only a single image. Central to InstaFace, we introduce an efficient guidance network that harnesses 3D perspectives by integrating multiple 3DMM-based conditionals without introducing additional trainable parameters. Moreover, to ensure maximum identity retention as well as preservation of background, hair, and other contextual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
