PlacidDreamer: Advancing Harmony in Text-to-3D Generation
Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan, Zhang, Pengfei Wan, Di Zhang, Jia Jia

TL;DR
PlacidDreamer introduces a unified text-to-3D generation framework that harmonizes multi-view and text-conditioned generation using a single diffusion model, and employs a novel score distillation method to balance detail richness and saturation.
Contribution
It proposes the Latent-Plane module for unified multi-view diffusion and a Balanced Score Distillation algorithm to address saturation issues in text-to-3D generation.
Findings
Outperforms previous methods in generating diverse and detailed 3D assets.
Effectively balances detail richness and saturation in generated 3D models.
Validated through extensive experiments demonstrating superior quality.
Abstract
Recently, text-to-3D generation has attracted significant attention, resulting in notable performance enhancements. Previous methods utilize end-to-end 3D generation models to initialize 3D Gaussians, multi-view diffusion models to enforce multi-view consistency, and text-to-image diffusion models to refine details with score distillation algorithms. However, these methods exhibit two limitations. Firstly, they encounter conflicts in generation directions since different models aim to produce diverse 3D assets. Secondly, the issue of over-saturation in score distillation has not been thoroughly investigated and solved. To address these limitations, we propose PlacidDreamer, a text-to-3D framework that harmonizes initialization, multi-view generation, and text-conditioned generation with a single multi-view diffusion model, while simultaneously employing a novel score distillation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Image Processing and 3D Reconstruction
MethodsDiffusion
