HumanGif: Single-View Human Diffusion with Generative Prior

Shoukang Hu; Takuya Narihira; Kazumi Fukuda; Ryosuke Sawata; Takashi Shibuya; Yuki Mitsufuji

arXiv:2502.12080·cs.CV·July 1, 2025

HumanGif: Single-View Human Diffusion with Generative Prior

Shoukang Hu, Takuya Narihira, Kazumi Fukuda, Ryosuke Sawata, Takashi Shibuya, Yuki Mitsufuji

PDF

Open Access 1 Repo 1 Models

TL;DR

HumanGif is a novel single-view human diffusion model that synthesizes view-consistent, temporally coherent 3D human avatars by leveraging generative priors and a Human NeRF module, outperforming previous methods.

Contribution

The paper introduces HumanGif, a new single-view 3D human synthesis approach combining diffusion models with a Human NeRF module for improved view and pose consistency.

Findings

01

Achieves superior perceptual quality in novel view and pose synthesis.

02

Demonstrates strong generalization across multiple datasets.

03

Outperforms existing methods in view consistency and temporal coherence.

Abstract

Previous 3D human creation methods have made significant progress in synthesizing view-consistent and temporally aligned results from sparse-view images or monocular videos. However, it remains challenging to produce perpetually realistic, view-consistent, and temporally coherent human avatars from a single image, as limited information is available in the single-view input setting. Motivated by the success of 2D character animation, we propose HumanGif, a single-view human diffusion model with generative prior. Specifically, we formulate the single-view-based 3D human novel view and pose synthesis as a single-view-conditioned human diffusion process, utilizing generative priors from foundational diffusion models to complement the missing information. To ensure fine-grained and consistent novel view and pose synthesis, we introduce a Human NeRF module in HumanGif to learn spatially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

skhu101/humangif
pytorchOfficial

Models

🤗
Sony/humangif
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis

MethodsDiffusion