Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion
Haoran Wei, Wencheng Han, Xingping Dong, Jianbing Shen

TL;DR
This paper introduces a novel diffusion-based method that leverages multi-view priors during conditioning and diffusion processes to generate high-fidelity, detailed 3D portraits from a single image, addressing previous issues of blurred textures.
Contribution
The paper proposes a Hybrid Priors Diffusion model and a Multi-View Noise Resampling strategy to improve cross-view consistency and detail richness in 3D portrait generation.
Findings
Produces 3D portraits with accurate geometry and rich details
Outperforms previous methods in view consistency and texture quality
Effective from a single input image
Abstract
Recent diffusion-based Single-image 3D portrait generation methods typically employ 2D diffusion models to provide multi-view knowledge, which is then distilled into 3D representations. However, these methods usually struggle to produce high-fidelity 3D models, frequently yielding excessively blurred textures. We attribute this issue to the insufficient consideration of cross-view consistency during the diffusion process, resulting in significant disparities between different views and ultimately leading to blurred 3D representations. In this paper, we address this issue by comprehensively exploiting multi-view priors in both the conditioning and diffusion procedures to produce consistent, detail-rich portraits. From the conditioning standpoint, we propose a Hybrid Priors Diffsion model, which explicitly and implicitly incorporates multi-view priors as conditions to enhance the status…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · 3D Shape Modeling and Analysis
MethodsDiffusion
