DreamFusion: Text-to-3D using 2D Diffusion

Ben Poole; Ajay Jain; Jonathan T. Barron; Ben Mildenhall

arXiv:2209.14988·cs.CV·September 30, 2022·463 cites

DreamFusion: Text-to-3D using 2D Diffusion

Ben Poole, Ajay Jain, Jonathan T. Barron, Ben Mildenhall

PDF

Open Access 4 Repos 1 Video

TL;DR

DreamFusion leverages a pretrained 2D diffusion model to generate 3D models from text prompts without requiring 3D training data, by optimizing Neural Radiance Fields through a novel probability density distillation loss.

Contribution

This work introduces a method to synthesize 3D models from text using only 2D diffusion models as priors, eliminating the need for large-scale 3D datasets.

Findings

01

Produces view-consistent 3D models from text prompts

02

No 3D training data or modifications to diffusion models needed

03

Enables relighting and compositing of generated 3D objects

Abstract

Recent breakthroughs in text-to-image synthesis have been driven by diffusion models trained on billions of image-text pairs. Adapting this approach to 3D synthesis would require large-scale datasets of labeled 3D data and efficient architectures for denoising 3D data, neither of which currently exist. In this work, we circumvent these limitations by using a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis. We introduce a loss based on probability density distillation that enables the use of a 2D diffusion model as a prior for optimization of a parametric image generator. Using this loss in a DeepDream-like procedure, we optimize a randomly-initialized 3D model (a Neural Radiance Field, or NeRF) via gradient descent such that its 2D renderings from random angles achieve a low loss. The resulting 3D model of the given text can be viewed from any angle, relit by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

DreamFusion: Text-to-3D using 2D Diffusion· slideslive

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Image and Video Retrieval Techniques · Image Processing and 3D Reconstruction

MethodsDiffusion