Animatable and Relightable Gaussians for High-fidelity Human Avatar   Modeling

Zhe Li; Yipengjing Sun; Zerong Zheng; Lizhen Wang; Shengping Zhang,; Yebin Liu

arXiv:2311.16096·cs.CV·May 28, 2024·5 cites

Animatable and Relightable Gaussians for High-fidelity Human Avatar Modeling

Zhe Li, Yipengjing Sun, Zerong Zheng, Lizhen Wang, Shengping Zhang,, Yebin Liu

PDF

Open Access 1 Repo

TL;DR

This paper presents Animatable Gaussians, a novel avatar representation combining 2D CNNs, 3D Gaussian splatting, and physically-based rendering to produce high-fidelity, relightable, and dynamically detailed human avatars from RGB videos.

Contribution

It introduces a new Gaussian-based avatar model that integrates template-guided 2D parameterization, StyleGAN-based CNNs, and physical rendering for realistic, animatable, and relightable human avatars.

Findings

01

Outperforms state-of-the-art methods in avatar quality

02

Produces lifelike, dynamic, and relightable avatars

03

Effectively models loose clothing and complex appearances

Abstract

Modeling animatable human avatars from RGB videos is a long-standing and challenging problem. Recent works usually adopt MLP-based neural radiance fields (NeRF) to represent 3D humans, but it remains difficult for pure MLPs to regress pose-dependent garment details. To this end, we introduce Animatable Gaussians, a new avatar representation that leverages powerful 2D CNNs and 3D Gaussian splatting to create high-fidelity avatars. To associate 3D Gaussians with the animatable avatar, we learn a parametric template from the input videos, and then parameterize the template on two front & back canonical Gaussian maps where each pixel represents a 3D Gaussian. The learned template is adaptive to the wearing garments for modeling looser clothes like dresses. Such template-guided 2D parameterization enables us to employ a powerful StyleGAN-based CNN to learn the pose-dependent Gaussian maps…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lizhe00/animatablegaussians
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · 3D Shape Modeling and Analysis · Advanced Vision and Imaging