Vista3D: Unravel the 3D Darkside of a Single Image

Qiuhong Shen; Xingyi Yang; Michael Bi Mi; Xinchao Wang

arXiv:2409.12193·cs.CV·September 19, 2024

Vista3D: Unravel the 3D Darkside of a Single Image

Qiuhong Shen, Xingyi Yang, Michael Bi Mi, Xinchao Wang

PDF

Open Access 1 Repo

TL;DR

Vista3D is a fast, two-phase framework that generates consistent 3D models from a single image by combining Gaussian Splatting, SDF optimization, and diffusion priors, achieving a balance between diversity and fidelity.

Contribution

The paper introduces Vista3D, a novel method that efficiently produces high-quality 3D reconstructions from a single image using a two-phase approach with disentangled implicit functions.

Findings

01

Generates 3D objects in under 5 minutes.

02

Balances diversity and consistency in 3D generation.

03

Effectively integrates 2D and 3D diffusion priors.

Abstract

We embark on the age-old quest: unveiling the hidden dimensions of objects from mere glimpses of their visible parts. To address this, we present Vista3D, a framework that realizes swift and consistent 3D generation within a mere 5 minutes. At the heart of Vista3D lies a two-phase approach: the coarse phase and the fine phase. In the coarse phase, we rapidly generate initial geometry with Gaussian Splatting from a single image. In the fine phase, we extract a Signed Distance Function (SDF) directly from learned Gaussian Splatting, optimizing it with a differentiable isosurface representation. Furthermore, it elevates the quality of generation by using a disentangled representation with two independent implicit functions to capture both visible and obscured aspects of objects. Additionally, it harmonizes gradients from 2D diffusion prior with 3D-aware diffusion priors by angular…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

florinshen/vista3d
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques

MethodsDiffusion