Points-to-3D: Bridging the Gap between Sparse Points and   Shape-Controllable Text-to-3D Generation

Chaohui Yu; Qiang Zhou; Jingliang Li; Zhe Zhang; Zhibin Wang; Fan Wang

arXiv:2307.13908·cs.CV·July 27, 2023·1 cites

Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation

Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang

PDF

Open Access

TL;DR

Points-to-3D introduces a novel framework that combines sparse 3D points from diffusion models with score distillation to enhance shape controllability and view consistency in text-to-3D generation.

Contribution

It proposes a flexible method that uses sparse 3D points as geometric priors and combines 2D and 3D diffusion models to improve realism and controllability in text-to-3D synthesis.

Findings

01

Improves view consistency in generated 3D models

02

Achieves better shape controllability compared to existing methods

03

Demonstrates superior qualitative and quantitative results

Abstract

Text-to-3D generation has recently garnered significant attention, fueled by 2D diffusion models trained on billions of image-text pairs. Existing methods primarily rely on score distillation to leverage the 2D diffusion priors to supervise the generation of 3D models, e.g., NeRF. However, score distillation is prone to suffer the view inconsistency problem, and implicit NeRF modeling can also lead to an arbitrary shape, thus leading to less realistic and uncontrollable 3D generation. In this work, we propose a flexible framework of Points-to-3D to bridge the gap between sparse yet freely available 3D points and realistic shape-controllable 3D generation by distilling the knowledge from both 2D and 3D diffusion models. The core idea of Points-to-3D is to introduce controllable sparse 3D points to guide the text-to-3D generation. Specifically, we use the sparse point cloud generated from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques

MethodsDiffusion · ALIGN