Magic123: One Image to High-Quality 3D Object Generation Using Both 2D   and 3D Diffusion Priors

Guocheng Qian; Jinjie Mai; Abdullah Hamdi; Jian Ren; Aliaksandr; Siarohin; Bing Li; Hsin-Ying Lee; Ivan Skorokhodov; Peter Wonka; Sergey; Tulyakov; Bernard Ghanem

arXiv:2306.17843·cs.CV·July 25, 2023·75 cites

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Guocheng Qian, Jinjie Mai, Abdullah Hamdi, Jian Ren, Aliaksandr, Siarohin, Bing Li, Hsin-Ying Lee, Ivan Skorokhodov, Peter Wonka, Sergey, Tulyakov, Bernard Ghanem

PDF

Open Access 1 Repo

TL;DR

Magic123 introduces a two-stage method that generates high-quality, textured 3D meshes from a single image by leveraging both 2D and 3D diffusion priors, improving over previous techniques.

Contribution

It proposes a novel coarse-to-fine approach combining 2D and 3D diffusion priors with a controllable trade-off parameter for better 3D object generation from images.

Findings

01

Significant improvement over previous image-to-3D methods.

02

Effective use of a single parameter to balance exploration and exploitation.

03

Validated on synthetic benchmarks and real-world images.

Abstract

We present Magic123, a two-stage coarse-to-fine approach for high-quality, textured 3D meshes generation from a single unposed image in the wild using both2D and 3D priors. In the first stage, we optimize a neural radiance field to produce a coarse geometry. In the second stage, we adopt a memory-efficient differentiable mesh representation to yield a high-resolution mesh with a visually appealing texture. In both stages, the 3D content is learned through reference view supervision and novel views guided by a combination of 2D and 3D diffusion priors. We introduce a single trade-off parameter between the 2D and 3D priors to control exploration (more imaginative) and exploitation (more precise) of the generated geometry. Additionally, we employ textual inversion and monocular depth regularization to encourage consistent appearances across views and to prevent degenerate solutions,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

guochengqian/magic123
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis

MethodsDiffusion