HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion

Yingzhi Tang; Qijian Zhang; and Junhui Hou

arXiv:2501.15008·cs.CV·October 17, 2025

HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion

Yingzhi Tang, Qijian Zhang, and Junhui Hou

PDF

Open Access

TL;DR

HuGDiffusion introduces a diffusion-based pipeline for single-image 3D human rendering that leverages human priors and a multi-stage generation process to improve novel view synthesis without requiring multi-view data.

Contribution

It presents a novel diffusion framework conditioned on human priors for single-image 3D human rendering, with a multi-stage attribute generation strategy and proxy supervision.

Findings

01

Outperforms state-of-the-art methods in 3D human rendering tasks.

02

Effectively generates 3D Gaussian splatting attributes from a single image.

03

Demonstrates robustness across diverse human poses and appearances.

Abstract

We present HuGDiffusion, a generalizable 3D Gaussian splatting (3DGS) learning pipeline to achieve novel view synthesis (NVS) of human characters from single-view input images. Existing approaches typically require monocular videos or calibrated multi-view images as inputs, whose applicability could be weakened in real-world scenarios with arbitrary and/or unknown camera poses. In this paper, we aim to generate the set of 3DGS attributes via a diffusion-based framework conditioned on human priors extracted from a single image. Specifically, we begin with carefully integrated human-centric feature extraction procedures to deduce informative conditioning signals. Based on our empirical observations that jointly learning the whole 3DGS attributes is challenging to optimize, we design a multi-stage generation strategy to obtain different types of 3DGS attributes. To facilitate the training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Video Surveillance and Tracking Methods · Computer Graphics and Visualization Techniques

MethodsSparse Evolutionary Training