Neural Style-Preserving Visual Dubbing
Hyeongwoo Kim, Mohamed Elgharib, Michael Zollh\"ofer, Hans-Peter, Seidel, Thabo Beeler, Christian Richardt, Christian Theobalt

TL;DR
This paper introduces a style-preserving visual dubbing method that modifies facial expressions in target videos to match foreign languages while maintaining individual actor styles, using a recurrent GAN trained unsupervisedly.
Contribution
It presents a novel approach that captures and preserves actor-specific expression styles during dubbing, improving realism over previous methods.
Findings
Maintains target actor's style better than previous methods
Generates temporally coherent and photorealistic videos
Handles dynamic backgrounds effectively
Abstract
Dubbing is a technique for translating video content from one language to another. However, state-of-the-art visual dubbing techniques directly copy facial expressions from source to target actors without considering identity-specific idiosyncrasies such as a unique type of smile. We present a style-preserving visual dubbing approach from single video inputs, which maintains the signature style of target actors when modifying facial expressions, including mouth motions, to match foreign languages. At the heart of our approach is the concept of motion style, in particular for facial expressions, i.e., the person-specific expression change that is yet another essential factor beyond visual accuracy in face editing applications. Our method is based on a recurrent generative adversarial network that captures the spatiotemporal co-activation of facial expressions, and enables generating and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
