GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
Gyumin Shim, Sangmin Lee, Jaegul Choo

TL;DR
GaussianMotion is a new human rendering approach that creates fully animatable 3D avatars from text, combining Gaussian Splatting with pose-guided learning for high fidelity and efficiency.
Contribution
It introduces a novel deformable Gaussian Splatting model with end-to-end text-to-3D training and adaptive score distillation for animatable human avatars.
Findings
Outperforms existing methods in quality and diversity.
Produces high-fidelity static and animated 3D human models.
Efficiently generates diverse avatars from textual descriptions.
Abstract
In this paper, we introduce GaussianMotion, a novel human rendering model that generates fully animatable scenes aligned with textual descriptions using Gaussian Splatting. Although existing methods achieve reasonable text-to-3D generation of human bodies using various 3D representations, they often face limitations in fidelity and efficiency, or primarily focus on static models with limited pose control. In contrast, our method generates fully animatable 3D avatars by combining deformable 3D Gaussian Splatting with text-to-3D score distillation, achieving high fidelity and efficient rendering for arbitrary poses. By densely generating diverse random poses during optimization, our deformable 3D human model learns to capture a wide range of natural motions distilled from a pose-conditioned diffusion model in an end-to-end manner. Furthermore, we propose Adaptive Score Distillation that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Artificial Intelligence in Games
MethodsDiffusion · Focus
