GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality
Taoran Yi, Jiemin Fang, Zanwei Zhou, Junjie Wang, Guanjun Wu, Lingxi, Xie, Xiaopeng Zhang, Wenyu Liu, Xinggang Wang, Qi Tian

TL;DR
GaussianDreamerPro introduces a novel framework that enhances the quality of text-to-3D Gaussian asset generation by binding Gaussians to evolving geometry, resulting in highly detailed and manipulable 3D assets.
Contribution
It proposes a new method to control Gaussian growth during generation by binding them to geometry, significantly improving quality over previous approaches.
Findings
Generated assets show significantly enhanced details and quality.
Assets can be seamlessly integrated into downstream manipulation pipelines.
The framework enables progressive enrichment of geometry and appearance.
Abstract
Recently, 3D Gaussian splatting (3D-GS) has achieved great success in reconstructing and rendering real-world scenes. To transfer the high rendering quality to generation tasks, a series of research works attempt to generate 3D-Gaussian assets from text. However, the generated assets have not achieved the same quality as those in reconstruction tasks. We observe that Gaussians tend to grow without control as the generation process may cause indeterminacy. Aiming at highly enhancing the generation quality, we propose a novel framework named GaussianDreamerPro. The main idea is to bind Gaussians to reasonable geometry, which evolves over the whole generation process. Along different stages of our framework, both the geometry and appearance can be enriched progressively. The final output asset is constructed with 3D Gaussians bound to mesh, which shows significantly enhanced details and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Computer Graphics and Visualization Techniques · Human Motion and Animation
