Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler, Karsten, Kreis

TL;DR
This paper introduces Align Your Gaussians (AYG), a novel method for synthesizing dynamic 4D scenes from text by combining multiview diffusion models with Gaussian splatting and deformation fields, achieving state-of-the-art results.
Contribution
AYG presents a new approach using dynamic 3D Gaussian Splatting with deformation fields for text-to-4D synthesis, incorporating regularization, motion amplification, and autoregressive schemes.
Findings
Outperforms previous methods qualitatively and quantitatively.
Enables seamless combination of different 4D animations.
Achieves state-of-the-art text-to-4D generation performance.
Abstract
Text-guided diffusion models have revolutionized image and video generation and have also been successfully used for optimization-based 3D object synthesis. Here, we instead focus on the underexplored text-to-4D setting and synthesize dynamic, animated 3D objects using score distillation methods with an additional temporal dimension. Compared to previous work, we pursue a novel compositional generation-based approach, and combine text-to-image, text-to-video, and 3D-aware multiview diffusion models to provide feedback during 4D object optimization, thereby simultaneously enforcing temporal consistency, high-quality visual appearance and realistic geometry. Our method, called Align Your Gaussians (AYG), leverages dynamic 3D Gaussian Splatting with deformation fields as 4D representation. Crucial to AYG is a novel method to regularize the distribution of the moving 3D Gaussians and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
MethodsFocus · Diffusion · ALIGN
