HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion
Yingzhi Tang, Qijian Zhang, and Junhui Hou

TL;DR
HuGDiffusion introduces a diffusion-based pipeline for single-image 3D human rendering that leverages human priors and a multi-stage generation process to improve novel view synthesis without requiring multi-view data.
Contribution
It presents a novel diffusion framework conditioned on human priors for single-image 3D human rendering, with a multi-stage attribute generation strategy and proxy supervision.
Findings
Outperforms state-of-the-art methods in 3D human rendering tasks.
Effectively generates 3D Gaussian splatting attributes from a single image.
Demonstrates robustness across diverse human poses and appearances.
Abstract
We present HuGDiffusion, a generalizable 3D Gaussian splatting (3DGS) learning pipeline to achieve novel view synthesis (NVS) of human characters from single-view input images. Existing approaches typically require monocular videos or calibrated multi-view images as inputs, whose applicability could be weakened in real-world scenarios with arbitrary and/or unknown camera poses. In this paper, we aim to generate the set of 3DGS attributes via a diffusion-based framework conditioned on human priors extracted from a single image. Specifically, we begin with carefully integrated human-centric feature extraction procedures to deduce informative conditioning signals. Based on our empirical observations that jointly learning the whole 3DGS attributes is challenging to optimize, we design a multi-stage generation strategy to obtain different types of 3DGS attributes. To facilitate the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Video Surveillance and Tracking Methods · Computer Graphics and Visualization Techniques
MethodsSparse Evolutionary Training
