Retrieval-Augmented Score Distillation for Text-to-3D Generation
Junyoung Seo, Susung Hong, Wooseok Jang, In\`es Hyeonsu Kim, Minseop, Kwak, Doyup Lee, Seungryong Kim

TL;DR
ReDream introduces a retrieval-augmented method for text-to-3D generation that enhances geometric consistency and scene fidelity by leveraging semantically relevant assets during the diffusion process.
Contribution
It proposes a novel retrieval-based framework that incorporates geometric priors from relevant assets to improve 3D generation quality and consistency.
Findings
Significant improvements in geometry and fidelity of generated 3D scenes.
ReDream outperforms existing methods in quality and consistency.
Extensive experiments validate the effectiveness of the retrieval-augmented approach.
Abstract
Text-to-3D generation has achieved significant success by incorporating powerful 2D diffusion models, but insufficient 3D prior knowledge also leads to the inconsistency of 3D geometry. Recently, since large-scale multi-view datasets have been released, fine-tuning the diffusion model on the multi-view datasets becomes a mainstream to solve the 3D inconsistency problem. However, it has confronted with fundamental difficulties regarding the limited quality and diversity of 3D data, compared with 2D data. To sidestep these trade-offs, we explore a retrieval-augmented approach tailored for score distillation, dubbed ReDream. We postulate that both expressiveness of 2D diffusion models and geometric consistency of 3D assets can be fully leveraged by employing the semantically relevant assets directly within the optimization process. To this end, we introduce novel framework for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Handwritten Text Recognition Techniques · Image Processing and 3D Reconstruction
MethodsDiffusion
