3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic   Gaussian Avatar Modeling

Zichen Tang; Hongyu Yang; Hanchen Zhang; Jiaxin Chen; Di Huang

arXiv:2412.11599·cs.CV·December 17, 2024

3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling

Zichen Tang, Hongyu Yang, Hanchen Zhang, Jiaxin Chen, Di Huang

PDF

Open Access 1 Repo 1 Video

TL;DR

3D$^2$-Actor introduces a pose-conditioned, 3D-aware denoising approach that enhances the realism and pose generalization of 3D human avatars from sparse multi-view RGB videos.

Contribution

It presents a novel pipeline combining 2D denoising guided by pose cues with a Gaussian-based 3D rectifier for improved 3D avatar modeling and animation.

Findings

01

Achieves high-fidelity 3D avatar reconstruction.

02

Generalizes effectively to novel poses.

03

Ensures smooth temporal continuity in video synthesis.

Abstract

Advancements in neural implicit representations and differentiable rendering have markedly improved the ability to learn animatable 3D avatars from sparse multi-view RGB videos. However, current methods that map observation space to canonical space often face challenges in capturing pose-dependent details and generalizing to novel poses. While diffusion models have demonstrated remarkable zero-shot capabilities in 2D image generation, their potential for creating animatable 3D avatars from 2D inputs remains underexplored. In this work, we introduce 3D $^{2}$ -Actor, a novel approach featuring a pose-conditioned 3D-aware human modeling pipeline that integrates iterative 2D denoising and 3D rectifying steps. The 2D denoiser, guided by pose cues, generates detailed multi-view images that provide the rich feature set necessary for high-fidelity 3D reconstruction and pose rendering.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

silence-tang/gaussianactor
pytorchOfficial

Videos

3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling· underline

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · 3D Shape Modeling and Analysis

MethodsSparse Evolutionary Training · Diffusion