Neural Style-Preserving Visual Dubbing

Hyeongwoo Kim; Mohamed Elgharib; Michael Zollh\"ofer; Hans-Peter; Seidel; Thabo Beeler; Christian Richardt; Christian Theobalt

arXiv:1909.02518·cs.CV·September 9, 2019

Neural Style-Preserving Visual Dubbing

Hyeongwoo Kim, Mohamed Elgharib, Michael Zollh\"ofer, Hans-Peter, Seidel, Thabo Beeler, Christian Richardt, Christian Theobalt

PDF

TL;DR

This paper introduces a style-preserving visual dubbing method that modifies facial expressions in target videos to match foreign languages while maintaining individual actor styles, using a recurrent GAN trained unsupervisedly.

Contribution

It presents a novel approach that captures and preserves actor-specific expression styles during dubbing, improving realism over previous methods.

Findings

01

Maintains target actor's style better than previous methods

02

Generates temporally coherent and photorealistic videos

03

Handles dynamic backgrounds effectively

Abstract

Dubbing is a technique for translating video content from one language to another. However, state-of-the-art visual dubbing techniques directly copy facial expressions from source to target actors without considering identity-specific idiosyncrasies such as a unique type of smile. We present a style-preserving visual dubbing approach from single video inputs, which maintains the signature style of target actors when modifying facial expressions, including mouth motions, to match foreign languages. At the heart of our approach is the concept of motion style, in particular for facial expressions, i.e., the person-specific expression change that is yet another essential factor beyond visual accuracy in face editing applications. Our method is based on a recurrent generative adversarial network that captures the spatiotemporal co-activation of facial expressions, and enables generating and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.