Towards Consistent and Controllable Image Synthesis for Face Editing

Mengting Wei; Tuomas Varanka; Yante Li; Xingxun Jiang; Huai-Qian Khor; Guoying Zhao

arXiv:2502.02465·cs.CV·November 27, 2025

Towards Consistent and Controllable Image Synthesis for Face Editing

Mengting Wei, Tuomas Varanka, Yante Li, Xingxun Jiang, Huai-Qian Khor, Guoying Zhao

PDF

Open Access 1 Models

TL;DR

This paper introduces RigFace, a novel face editing method that leverages Stable-Diffusion and 3D face models to achieve consistent, controllable, and high-quality face edits by disentangling background, pose, expression, and lighting controls.

Contribution

The paper presents a new approach combining a Spatial Attribute Encoder, FaceFusion, and Attribute Rigger to improve control and consistency in face editing with diffusion models.

Findings

01

Achieves superior identity preservation and photorealism.

02

Effectively disentangles control of background, pose, expression, and lighting.

03

Outperforms existing face editing methods in key metrics.

Abstract

Face editing methods, essential for tasks like virtual avatars, digital human synthesis and identity preservation, have traditionally been built upon GAN-based techniques, while recent focus has shifted to diffusion-based models due to their success in image reconstruction. However, diffusion models still face challenges in controlling specific attributes and preserving the consistency of other unchanged attributes especially the identity characteristics. To address these issues and facilitate more convenient editing of face images, we propose a novel approach that leverages the power of Stable-Diffusion (SD) models and crude 3D face models to control the lighting, facial expression and head pose of a portrait photo. We observe that this task essentially involves the combinations of target background, identity and face attributes aimed to edit. We strive to sufficiently disentangle the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
mengtingwei/rigface
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Advanced Image and Video Retrieval Techniques · Face and Expression Recognition

MethodsDiffusion · Focus