HumanNorm: Learning Normal Diffusion Model for High-quality and   Realistic 3D Human Generation

Xin Huang; Ruizhi Shao; Qi Zhang; Hongwen Zhang; Ying Feng; Yebin Liu,; Qing Wang

arXiv:2310.01406·cs.CV·November 30, 2023·5 cites

HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation

Xin Huang, Ruizhi Shao, Qi Zhang, Hongwen Zhang, Ying Feng, Yebin Liu,, Qing Wang

PDF

Open Access

TL;DR

HumanNorm introduces a novel diffusion-based approach that significantly improves the quality and realism of 3D human generation by learning normal and color alignment models, surpassing existing methods.

Contribution

The paper proposes a new normal-adapted and normal-aligned diffusion model to enhance 3D human generation quality and realism, addressing limitations of previous text-to-3D methods.

Findings

01

Outperforms existing text-to-3D methods in geometry quality

02

Produces more realistic textures and detailed geometry

03

Demonstrates effective progressive geometry generation

Abstract

Recent text-to-3D methods employing diffusion models have made significant advancements in 3D human generation. However, these approaches face challenges due to the limitations of text-to-image diffusion models, which lack an understanding of 3D structures. Consequently, these methods struggle to achieve high-quality human generation, resulting in smooth geometry and cartoon-like appearances. In this paper, we propose HumanNorm, a novel approach for high-quality and realistic 3D human generation. The main idea is to enhance the model's 2D perception of 3D geometry by learning a normal-adapted diffusion model and a normal-aligned diffusion model. The normal-adapted diffusion model can generate high-fidelity normal maps corresponding to user prompts with view-dependent and body-aware text. The normal-aligned diffusion model learns to generate color images aligned with the normal maps,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis

MethodsDiffusion