Towards High-Fidelity 3D Portrait Generation with Rich Details by   Cross-View Prior-Aware Diffusion

Haoran Wei; Wencheng Han; Xingping Dong; Jianbing Shen

arXiv:2411.10369·cs.CV·November 18, 2024

Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion

Haoran Wei, Wencheng Han, Xingping Dong, Jianbing Shen

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel diffusion-based method that leverages multi-view priors during conditioning and diffusion processes to generate high-fidelity, detailed 3D portraits from a single image, addressing previous issues of blurred textures.

Contribution

The paper proposes a Hybrid Priors Diffusion model and a Multi-View Noise Resampling strategy to improve cross-view consistency and detail richness in 3D portrait generation.

Findings

01

Produces 3D portraits with accurate geometry and rich details

02

Outperforms previous methods in view consistency and texture quality

03

Effective from a single input image

Abstract

Recent diffusion-based Single-image 3D portrait generation methods typically employ 2D diffusion models to provide multi-view knowledge, which is then distilled into 3D representations. However, these methods usually struggle to produce high-fidelity 3D models, frequently yielding excessively blurred textures. We attribute this issue to the insufficient consideration of cross-view consistency during the diffusion process, resulting in significant disparities between different views and ultimately leading to blurred 3D representations. In this paper, we address this issue by comprehensively exploiting multi-view priors in both the conditioning and diffusion procedures to produce consistent, detail-rich portraits. From the conditioning standpoint, we propose a Hybrid Priors Diffsion model, which explicitly and implicitly incorporates multi-view priors as conditions to enhance the status…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion· underline

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · 3D Shape Modeling and Analysis

MethodsDiffusion