Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video   Diffusion Models

Haibo Yang; Yang Chen; Yingwei Pan; Ting Yao; Zhineng Chen; and Chong-Wah Ngo; Tao Mei

arXiv:2409.07452·cs.CV·September 12, 2024

Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models

Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, and Chong-Wah Ngo, Tao Mei

PDF

Open Access 1 Repo 1 Models

TL;DR

Hi3D introduces a novel video diffusion-based approach for high-resolution 3D generation from a single image, achieving superior multi-view consistency and detailed textures through 3D-aware modeling and reconstruction.

Contribution

The paper presents Hi3D, a new paradigm combining video diffusion models with 3D-aware priors and Gaussian Splatting for high-resolution 3D object generation from a single image.

Findings

01

Produces high-fidelity, multi-view consistent images with detailed textures.

02

Outperforms existing methods in novel view synthesis and single view reconstruction.

03

Demonstrates effective 3D reconstruction from high-resolution multi-view images.

Abstract

Despite having tremendous progress in image-to-3D generation, existing methods still struggle to produce multi-view consistent images with high-resolution textures in detail, especially in the paradigm of 2D diffusion that lacks 3D awareness. In this work, we present High-resolution Image-to-3D model (Hi3D), a new video diffusion based paradigm that redefines a single image to multi-view images as 3D-aware sequential image generation (i.e., orbital video generation). This methodology delves into the underlying temporal consistency knowledge in video diffusion model that generalizes well to geometry consistency across multiple views in 3D generation. Technically, Hi3D first empowers the pre-trained video diffusion model with 3D-aware prior (camera pose condition), yielding multi-view images with low-resolution texture details. A 3D-aware video-to-video refiner is learnt to further scale…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanghb22-fdu/hi3d-official
pytorchOfficial

Models

🤗
hbyang/Hi3D
model· ♡ 5
♡ 5

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging

MethodsDiffusion