Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor
Anchit Gupta, Faizan Farooq Khan, Rudrabha Mukhopadhyay, Vinay P., Namboodiri, C. V. Jawahar

TL;DR
This paper introduces an advanced video editing tool that integrates modern talking face generation algorithms, enabling interactive lip-syncing, facial re-enactment, and automatic translation features to enhance synthetic video creation.
Contribution
It presents a user-friendly video editor that combines state-of-the-art facial video editing algorithms with manual controls for improved editing and synthesis quality.
Findings
Human evaluations show increased editing efficiency.
Improved quality of synthetic talking face videos.
Effective synchronization of background content with translated speech.
Abstract
This paper proposes a video editor based on OpenShot with several state-of-the-art facial video editing algorithms as added functionalities. Our editor provides an easy-to-use interface to apply modern lip-syncing algorithms interactively. Apart from lip-syncing, the editor also uses audio and facial re-enactment to generate expressive talking faces. The manual control improves the overall experience of video editing without missing out on the benefits of modern synthetic video generation algorithms. This control enables us to lip-sync complex dubbed movie scenes, interviews, television shows, and other visual content. Furthermore, our editor provides features that automatically translate lectures from spoken content, lip-sync of the professor, and background content like slides. While doing so, we also tackle the critical aspect of synchronizing background content with the translated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
