InstaFace: Identity-Preserving Facial Editing with Single Image   Inference

MD Wahiduzzaman Khan; Mingshan Jia; Xiaolin Zhang; En Yu; Caifeng; Shan; Kaska Musial-Gabrys

arXiv:2502.20577·cs.CV·March 10, 2025

InstaFace: Identity-Preserving Facial Editing with Single Image Inference

MD Wahiduzzaman Khan, Mingshan Jia, Xiaolin Zhang, En Yu, Caifeng, Shan, Kaska Musial-Gabrys

PDF

TL;DR

InstaFace is a diffusion-based framework that enables realistic facial editing from a single image while effectively preserving identity and contextual features, addressing limitations of previous methods.

Contribution

We introduce a novel diffusion model with an efficient guidance network and feature embedding modules for identity-preserving facial editing from a single image.

Findings

01

Outperforms state-of-the-art in identity preservation

02

Achieves high photorealism in edited images

03

Effective control over pose, expression, and lighting

Abstract

Facial appearance editing is crucial for digital avatars, AR/VR, and personalized content creation, driving realistic user experiences. However, preserving identity with generative models is challenging, especially in scenarios with limited data availability. Traditional methods often require multiple images and still struggle with unnatural face shifts, inconsistent hair alignment, or excessive smoothing effects. To overcome these challenges, we introduce a novel diffusion-based framework, InstaFace, to generate realistic images while preserving identity using only a single image. Central to InstaFace, we introduce an efficient guidance network that harnesses 3D perspectives by integrating multiple 3DMM-based conditionals without introducing additional trainable parameters. Moreover, to ensure maximum identity retention as well as preservation of background, hair, and other contextual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.