IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion

Tharun Anand; Aryan Garg; Kaushik Mitra

arXiv:2501.07530·cs.CV·January 14, 2025

IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion

Tharun Anand, Aryan Garg, Kaushik Mitra

PDF

TL;DR

IP-FaceDiff introduces a novel facial video editing method that leverages pre-trained diffusion models for high-quality, identity-preserving, and efficient text-driven edits across diverse scenarios.

Contribution

The paper presents a fine-tuning scheme for pre-trained diffusion models enabling localized, high-quality, identity-preserving facial video editing with significantly reduced editing time.

Findings

01

80% reduction in editing time

02

Superior identity preservation across frames

03

Effective across diverse facial expressions and poses

Abstract

Facial video editing has become increasingly important for content creators, enabling the manipulation of facial expressions and attributes. However, existing models encounter challenges such as poor editing quality, high computational costs and difficulties in preserving facial identity across diverse edits. Additionally, these models are often constrained to editing predefined facial attributes, limiting their flexibility to diverse editing prompts. To address these challenges, we propose a novel facial video editing framework that leverages the rich latent space of pre-trained text-to-image (T2I) diffusion models and fine-tune them specifically for facial video editing tasks. Our approach introduces a targeted fine-tuning scheme that enables high quality, localized, text-driven edits while ensuring identity preservation across video frames. Additionally, by using pre-trained T2I…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion · Sparse Evolutionary Training