DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior
Tianyu Huang, Yihan Zeng, Zhilu Zhang, Wan Xu, Hang Xu, Songcen Xu,, Rynson W. H. Lau, Wangmeng Zuo

TL;DR
DreamControl introduces a two-stage framework for controllable text-to-3D generation that addresses geometry inconsistency issues by leveraging 3D self-prior and control-based score distillation, resulting in high-quality 3D content.
Contribution
It proposes a novel two-stage 2D-lifting framework with adaptive viewpoint sampling and boundary integrity metrics to improve 3D generation quality and consistency.
Findings
Achieves high-quality 3D content with consistent geometry and textures.
Effective in downstream tasks like user-guided generation and 3D animation.
Addresses Janus problem in text-to-3D generation.
Abstract
3D generation has raised great attention in recent years. With the success of text-to-image diffusion models, the 2D-lifting technique becomes a promising route to controllable 3D generation. However, these methods tend to present inconsistent geometry, which is also known as the Janus problem. We observe that the problem is caused mainly by two aspects, i.e., viewpoint bias in 2D diffusion models and overfitting of the optimization objective. To address it, we propose a two-stage 2D-lifting framework, namely DreamControl, which optimizes coarse NeRF scenes as 3D self-prior and then generates fine-grained objects with control-based score distillation. Specifically, adaptive viewpoint sampling and boundary integrity metric are proposed to ensure the consistency of generated priors. The priors are then regarded as input conditions to maintain reasonable geometries, in which conditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
MethodsDiffusion
