GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
Baorui Ma, Haoge Deng, Junsheng Zhou, Yu-Shen Liu, Tiejun Huang,, Xinlong Wang

TL;DR
GeoDream introduces a method combining 3D priors with 2D diffusion models to improve 3D consistency and realism in text-to-3D generation, addressing geometric artifacts and inconsistency issues.
Contribution
The paper proposes a novel approach that integrates explicit 3D priors with 2D diffusion models, enhancing 3D geometric consistency and fidelity in generated 3D objects.
Findings
Produces more 3D consistent textured meshes
Generates high-resolution realistic renderings (1024x1024)
Achieves better semantic coherence in 3D generation
Abstract
Text-to-3D generation by distilling pretrained large-scale text-to-image diffusion models has shown great promise but still suffers from inconsistent 3D geometric structures (Janus problems) and severe artifacts. The aforementioned problems mainly stem from 2D diffusion models lacking 3D awareness during the lifting. In this work, we present GeoDream, a novel method that incorporates explicit generalized 3D priors with 2D diffusion priors to enhance the capability of obtaining unambiguous 3D consistent geometric structures without sacrificing diversity or fidelity. Specifically, we first utilize a multi-view diffusion model to generate posed images and then construct cost volume from the predicted image, which serves as native 3D geometric priors, ensuring spatial consistency in 3D space. Subsequently, we further propose to harness 3D geometric priors to unlock the great potential of 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
MethodsDiffusion
