PlacidDreamer: Advancing Harmony in Text-to-3D Generation

Shuo Huang; Shikun Sun; Zixuan Wang; Xiaoyu Qin; Yanmin Xiong; Yuan; Zhang; Pengfei Wan; Di Zhang; Jia Jia

arXiv:2407.13976·cs.CV·July 22, 2024

PlacidDreamer: Advancing Harmony in Text-to-3D Generation

Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan, Zhang, Pengfei Wan, Di Zhang, Jia Jia

PDF

Open Access 1 Repo

TL;DR

PlacidDreamer introduces a unified text-to-3D generation framework that harmonizes multi-view and text-conditioned generation using a single diffusion model, and employs a novel score distillation method to balance detail richness and saturation.

Contribution

It proposes the Latent-Plane module for unified multi-view diffusion and a Balanced Score Distillation algorithm to address saturation issues in text-to-3D generation.

Findings

01

Outperforms previous methods in generating diverse and detailed 3D assets.

02

Effectively balances detail richness and saturation in generated 3D models.

03

Validated through extensive experiments demonstrating superior quality.

Abstract

Recently, text-to-3D generation has attracted significant attention, resulting in notable performance enhancements. Previous methods utilize end-to-end 3D generation models to initialize 3D Gaussians, multi-view diffusion models to enforce multi-view consistency, and text-to-image diffusion models to refine details with score distillation algorithms. However, these methods exhibit two limitations. Firstly, they encounter conflicts in generation directions since different models aim to produce diverse 3D assets. Secondly, the issue of over-saturation in score distillation has not been thoroughly investigated and solved. To address these limitations, we propose PlacidDreamer, a text-to-3D framework that harmonizes initialization, multi-view generation, and text-conditioned generation with a single multi-view diffusion model, while simultaneously employing a novel score distillation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hansenhuang0823/placiddreamer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Image Processing and 3D Reconstruction

MethodsDiffusion