Personalized Face Inpainting with Diffusion Models by Parallel Visual   Attention

Jianjin Xu; Saman Motamed; Praneetha Vaddamanu; Chen Henry Wu,; Christian Haene; Jean-Charles Bazin; Fernando de la Torre

arXiv:2312.03556·cs.CV·December 7, 2023·1 cites

Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention

Jianjin Xu, Saman Motamed, Praneetha Vaddamanu, Chen Henry Wu,, Christian Haene, Jean-Charles Bazin, Fernando de la Torre

PDF

Open Access 1 Video

TL;DR

This paper introduces Parallel Visual Attention (PVA) with diffusion models for face inpainting, achieving superior identity preservation, user-controlled attributes, and faster inference with minimal fine-tuning.

Contribution

The paper proposes PVA integrated into diffusion models, enabling efficient, identity-preserving face inpainting with semantic control and reduced computational requirements.

Findings

01

PVA outperforms benchmarks like MyStyle and Custom Diffusion in identity preservation.

02

PVA achieves over 20x faster fine-tuning for new identities.

03

PVA provides effective language-guided face inpainting.

Abstract

Face inpainting is important in various applications, such as photo restoration, image editing, and virtual reality. Despite the significant advances in face generative models, ensuring that a person's unique facial identity is maintained during the inpainting process is still an elusive goal. Current state-of-the-art techniques, exemplified by MyStyle, necessitate resource-intensive fine-tuning and a substantial number of images for each new identity. Furthermore, existing methods often fall short in accommodating user-specified semantic attributes, such as beard or expression. To improve inpainting results, and reduce the computational complexity during inference, this paper proposes the use of Parallel Visual Attention (PVA) in conjunction with diffusion models. Specifically, we insert parallel attention matrices to each cross-attention module in the denoising network, which attends…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Personalized Face Inpainting With Diffusion Models by Parallel Visual Attention· youtube

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Facial Nerve Paralysis Treatment and Research

MethodsInpainting · Diffusion · Concatenated Skip Connection · Softmax · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings