PVP: Personalized Video Prior for Editable Dynamic Portraits using   StyleGAN

Kai-En Lin; Alex Trevithick; Keli Cheng; Michel Sarkis and; Mohsen Ghafoorian; Ning Bi; Gerhard Reitmayr; Ravi Ramamoorthi

arXiv:2306.17123·cs.CV·June 30, 2023

PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN

Kai-En Lin, Alex Trevithick, Keli Cheng, Michel Sarkis and, Mohsen Ghafoorian, Ning Bi, Gerhard Reitmayr, Ravi Ramamoorthi

PDF

Open Access

TL;DR

This paper introduces a method to create editable, dynamic 3D portraits from monocular videos, enabling novel viewpoints and expressions, by leveraging StyleGAN and personalized priors for improved pose handling and real-time performance.

Contribution

The work develops a personalized video prior using pivotal tuning inversion, allowing for extreme pose editing and expression manipulation in monocular portrait videos, surpassing previous methods.

Findings

01

Outperforms previous approaches on monocular video datasets.

02

Capable of real-time synthesis at 54 FPS on an RTX 3080.

03

Effectively disentangles pose and expression in the latent space.

Abstract

Portrait synthesis creates realistic digital avatars which enable users to interact with others in a compelling way. Recent advances in StyleGAN and its extensions have shown promising results in synthesizing photorealistic and accurate reconstruction of human faces. However, previous methods often focus on frontal face synthesis and most methods are not able to handle large head rotations due to the training data distribution of StyleGAN. In this work, our goal is to take as input a monocular video of a face, and create an editable dynamic portrait able to handle extreme head poses. The user can create novel viewpoints, edit the appearance, and animate the face. Our method utilizes pivotal tuning inversion (PTI) to learn a personalized video prior from a monocular video sequence. Then we can input pose and expression coefficients to MLPs and manipulate the latent vectors to synthesize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis

MethodsConvolution · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Adaptive Instance Normalization · Dense Connections · Feedforward Network · StyleGAN · Focus