Guide3D: Create 3D Avatars from Text and Image Guidance

Yukang Cao; Yan-Pei Cao; Kai Han; Ying Shan; Kwan-Yee K. Wong

arXiv:2308.09705·cs.CV·August 21, 2023·1 cites

Guide3D: Create 3D Avatars from Text and Image Guidance

Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, Kwan-Yee K. Wong

PDF

Open Access

TL;DR

Guide3D is a novel zero-shot 3D avatar generation model that leverages diffusion models and multi-view image features to produce high-quality textured 3D meshes from text and images.

Contribution

It introduces a new framework combining diffusion models with differentiable grids and feature fusion for high-resolution 3D avatar synthesis from text and images.

Findings

01

Outperforms state-of-the-art in geometry and texture quality

02

Effectively transfers 2D image features to 3D models

03

Enhances 3D generation with novel training objectives

Abstract

Recently, text-to-image generation has exhibited remarkable advancements, with the ability to produce visually impressive results. In contrast, text-to-3D generation has not yet reached a comparable level of quality. Existing methods primarily rely on text-guided score distillation sampling (SDS), and they encounter difficulties in transferring 2D attributes of the generated images to 3D content. In this work, we aim to develop an effective 3D generative model capable of synthesizing high-resolution textured meshes by leveraging both textual and image information. To this end, we introduce Guide3D, a zero-shot text-and-image-guided generative model for 3D avatar generation based on diffusion models. Our model involves (1) generating sparse-view images of a text-consistent character using diffusion models, and (2) jointly optimizing multi-resolution differentiable marching tetrahedral…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging

MethodsDiffusion