Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

Ruoxi Shi; Hansheng Chen; Zhuoyang Zhang; Minghua Liu; Chao Xu; Xinyue; Wei; Linghao Chen; Chong Zeng; Hao Su

arXiv:2310.15110·cs.CV·October 24, 2023·35 cites

Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

Ruoxi Shi, Hansheng Chen, Zhuoyang Zhang, Minghua Liu, Chao Xu, Xinyue, Wei, Linghao Chen, Chong Zeng, Hao Su

PDF

Open Access 1 Repo

TL;DR

Zero123++ is a diffusion-based model that generates 3D-consistent multi-view images from a single image, leveraging pretrained 2D priors and minimal fine-tuning to improve quality and control.

Contribution

It introduces a novel conditioning and training scheme for single-image to multi-view generation, enabling high-quality, consistent outputs with minimal fine-tuning.

Findings

01

Produces high-quality, 3D-consistent multi-view images from a single input.

02

Overcomes texture degradation and geometric misalignment issues.

03

Enables training of ControlNet for enhanced generation control.

Abstract

We report Zero123++, an image-conditioned diffusion model for generating 3D-consistent multi-view images from a single input view. To take full advantage of pretrained 2D generative priors, we develop various conditioning and training schemes to minimize the effort of finetuning from off-the-shelf image diffusion models such as Stable Diffusion. Zero123++ excels in producing high-quality, consistent multi-view images from a single image, overcoming common issues like texture degradation and geometric misalignment. Furthermore, we showcase the feasibility of training a ControlNet on Zero123++ for enhanced control over the generation process. The code is available at https://github.com/SUDO-AI-3D/zero123plus.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sudo-ai-3d/zero123plus
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Image Processing Techniques

MethodsDiffusion