Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos

Alfredo Rivero; ShahRukh Athar; Zhixin Shu; Dimitris Samaras

arXiv:2402.03723·cs.CV·February 7, 2024·1 cites

Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos

Alfredo Rivero, ShahRukh Athar, Zhixin Shu, Dimitris Samaras

PDF

Open Access

TL;DR

Rig3DGS enables creation of controllable 3D human portraits from casual monocular videos by disentangling head movements and expressions using a novel deformation method guided by a 3D morphable model.

Contribution

The paper introduces a new deformation technique for 3D Gaussian representations that allows high-quality, controllable portrait rendering from single-view videos.

Findings

01

Effective control of facial expressions and head poses.

02

High-quality rendering comparable to multi-view methods.

03

Efficient training process demonstrated through experiments.

Abstract

Creating controllable 3D human portraits from casual smartphone videos is highly desirable due to their immense value in AR/VR applications. The recent development of 3D Gaussian Splatting (3DGS) has shown improvements in rendering quality and training efficiency. However, it still remains a challenge to accurately model and disentangle head movements and facial expressions from a single-view capture to achieve high-quality renderings. In this paper, we introduce Rig3DGS to address this challenge. We represent the entire scene, including the dynamic subject, using a set of 3D Gaussians in a canonical space. Using a set of control signals, such as head pose and expressions, we transform them to the 3D space with learned deformations to generate the desired rendering. Our key innovation is a carefully designed deformation method which is guided by a learnable prior derived from a 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Advanced Vision and Imaging · Augmented Reality Applications

MethodsSparse Evolutionary Training