Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models
Sanoojan Baliah, Qinliang Lin, Shengcai Liao, Xiaodan Liang, Muhammad, Haris Khan

TL;DR
This paper introduces a unified diffusion-based face swapping method that improves realism and fidelity by framing the task as a self-supervised inpainting problem, utilizing CLIP features, and enabling head and accessory swapping.
Contribution
A novel diffusion model approach for face swapping that incorporates self-supervised inpainting, CLIP feature disentanglement, and mask shuffling for a universal, head-swapping capable model.
Findings
High-fidelity, realistic face swapping demonstrated on FFHQ and CelebA datasets.
Robustness to pose variation, occlusion, and color differences.
Efficient inference with minimal artifacts.
Abstract
Despite promising progress in face swapping task, realistic swapped images remain elusive, often marred by artifacts, particularly in scenarios involving high pose variation, color differences, and occlusion. To address these issues, we propose a novel approach that better harnesses diffusion models for face-swapping by making following core contributions. (a) We propose to re-frame the face-swapping task as a self-supervised, train-time inpainting problem, enhancing the identity transfer while blending with the target image. (b) We introduce a multi-step Denoising Diffusion Implicit Model (DDIM) sampling during training, reinforcing identity and perceptual similarities. (c) Third, we introduce CLIP feature disentanglement to extract pose, expression, and lighting information from the target image, improving fidelity. (d) Further, we introduce a mask shuffling technique during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Asian Geopolitics and Ethnography · China's Ethnic Minorities and Relations
MethodsDiffusion · Inpainting · Contrastive Language-Image Pre-training
