Zero-1-to-3: Zero-shot One Image to 3D Object

Ruoshi Liu; Rundi Wu; Basile Van Hoorick; Pavel Tokmakov; Sergey; Zakharov; Carl Vondrick

arXiv:2303.11328·cs.CV·March 21, 2023·21 cites

Zero-1-to-3: Zero-shot One Image to 3D Object

Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey, Zakharov, Carl Vondrick

PDF

Open Access 1 Repo 7 Models 2 Datasets 1 Video

TL;DR

Zero-1-to-3 introduces a diffusion-based framework that synthesizes novel object views from a single image by leveraging geometric priors learned from large-scale models, enabling effective 3D reconstruction and view synthesis.

Contribution

It presents a novel viewpoint-conditioned diffusion model trained on synthetic data that generalizes zero-shot to real-world images for 3D object view synthesis.

Findings

01

Outperforms existing single-view 3D reconstruction models

02

Retains strong zero-shot generalization to in-the-wild images

03

Effectively synthesizes novel views from a single image

Abstract

We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an object given just a single RGB image. To perform novel view synthesis in this under-constrained setting, we capitalize on the geometric priors that large-scale diffusion models learn about natural images. Our conditional diffusion model uses a synthetic dataset to learn controls of the relative camera viewpoint, which allow new images to be generated of the same object under a specified camera transformation. Even though it is trained on a synthetic dataset, our model retains a strong zero-shot generalization ability to out-of-distribution datasets as well as in-the-wild images, including impressionist paintings. Our viewpoint-conditioned diffusion approach can further be used for the task of 3D reconstruction from a single image. Qualitative and quantitative experiments show that our method significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cvlab-columbia/zero123
pytorchOfficial

Models

Datasets

Videos

Zero-1-to-3: Zero-shot One Image to 3D Object· youtube

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis

MethodsDiffusion