HumanGif: Single-View Human Diffusion with Generative Prior
Shoukang Hu, Takuya Narihira, Kazumi Fukuda, Ryosuke Sawata, Takashi Shibuya, Yuki Mitsufuji

TL;DR
HumanGif is a novel single-view human diffusion model that synthesizes view-consistent, temporally coherent 3D human avatars by leveraging generative priors and a Human NeRF module, outperforming previous methods.
Contribution
The paper introduces HumanGif, a new single-view 3D human synthesis approach combining diffusion models with a Human NeRF module for improved view and pose consistency.
Findings
Achieves superior perceptual quality in novel view and pose synthesis.
Demonstrates strong generalization across multiple datasets.
Outperforms existing methods in view consistency and temporal coherence.
Abstract
Previous 3D human creation methods have made significant progress in synthesizing view-consistent and temporally aligned results from sparse-view images or monocular videos. However, it remains challenging to produce perpetually realistic, view-consistent, and temporally coherent human avatars from a single image, as limited information is available in the single-view input setting. Motivated by the success of 2D character animation, we propose HumanGif, a single-view human diffusion model with generative prior. Specifically, we formulate the single-view-based 3D human novel view and pose synthesis as a single-view-conditioned human diffusion process, utilizing generative priors from foundational diffusion models to complement the missing information. To ensure fine-grained and consistent novel view and pose synthesis, we introduce a Human NeRF module in HumanGif to learn spatially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion
