RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D
Lingteng Qiu, Guanying Chen, Xiaodong Gu, Qi Zuo, Mutian Xu, Yushuang, Wu, Weihao Yuan, Zilong Dong, Liefeng Bo, Xiaoguang Han

TL;DR
RichDreamer introduces a generalizable Normal-Depth diffusion model trained on large-scale data, significantly improving detail richness in text-to-3D generation by better modeling scene geometry and appearance.
Contribution
It proposes a novel Normal-Depth diffusion model trained on large datasets, enhancing 3D generation quality and detail over previous 2D diffusion-based methods.
Findings
Enhanced detail richness in generated 3D models.
Achieved state-of-the-art results in text-to-3D tasks.
Improved stability by using normal and depth priors.
Abstract
Lifting 2D diffusion for 3D generation is a challenging problem due to the lack of geometric prior and the complex entanglement of materials and lighting in natural images. Existing methods have shown promise by first creating the geometry through score-distillation sampling (SDS) applied to rendered surface normals, followed by appearance modeling. However, relying on a 2D RGB diffusion model to optimize surface normals is suboptimal due to the distribution discrepancy between natural images and normals maps, leading to instability in optimization. In this paper, recognizing that the normal and depth information effectively describe scene geometry and be automatically estimated from images, we propose to learn a generalizable Normal-Depth diffusion model for 3D generation. We achieve this by training on the large-scale LAION dataset together with the generalizable image-to-depth and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Image and Video Retrieval Techniques
MethodsDiffusion
