PerformRecast: Expression and Head Pose Disentanglement for Portrait Video Editing
Jiadong Liang, Bojun Xiong, Jie Tian, Hua Li, Xiao Long, Yong Zheng, Huan Fu

TL;DR
PerformRecast is a novel portrait video editing method that disentangles facial expression from head pose, enabling independent expression editing with high quality and control, based on improved 3D Morphable Face Model techniques.
Contribution
The paper introduces a new expression-only editing approach that improves disentanglement and control by enhancing 3DMM-based keypoint transformation and decoupling facial regions.
Findings
Outperforms existing methods in controllability and efficiency.
Produces more faithful and high-quality edited videos.
Enables fine-grained expression control independent of head pose.
Abstract
This paper primarily investigates the task of expression-only portrait video performance editing based on a driving video, which plays a crucial role in animation and film industries. Most existing research mainly focuses on portrait animation, which aims to animate a static portrait image according to the facial motion from the driving video. As a consequence, it remains challenging for them to disentangle the facial expression from head pose rotation and thus lack the ability to edit facial expression independently. In this paper, we propose PerformRecast, a versatile expression-only video editing method which is dedicated to recast the performance in existing film and animation. The key insight of our method comes from the characteristics of 3D Morphable Face Model (3DMM), which models the face identity, facial expression and head pose of 3D face mesh with separate parameters.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Emotion and Mood Recognition
