Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models
Zhibin Liu, Haoye Dong, Aviral Chharia, Hefeng Wu

TL;DR
Human-VDM introduces a novel approach to generate high-quality, view-consistent 3D human models from a single RGB image by leveraging video diffusion models and Gaussian Splatting, overcoming view inconsistency issues.
Contribution
It presents a new framework combining video diffusion and Gaussian Splatting to produce lifelike 3D humans from a single image, addressing view inconsistency problems in prior methods.
Findings
Outperforms state-of-the-art in quality and quantity of 3D human generation
Produces temporally consistent 3D models from a single image
Enhances textures and geometry via video augmentation techniques
Abstract
Generating lifelike 3D humans from a single RGB image remains a challenging task in computer vision, as it requires accurate modeling of geometry, high-quality texture, and plausible unseen parts. Existing methods typically use multi-view diffusion models for 3D generation, but they often face inconsistent view issues, which hinder high-quality 3D human generation. To address this, we propose Human-VDM, a novel method for generating 3D human from a single RGB image using Video Diffusion Models. Human-VDM provides temporally consistent views for 3D human generation using Gaussian Splatting. It consists of three modules: a view-consistent human video diffusion module, a video augmentation module, and a Gaussian Splatting module. First, a single image is fed into a human video diffusion module to generate a coherent human video. Next, the video augmentation module applies super-resolution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image and Video Quality Assessment · Optical Imaging and Spectroscopy Techniques
MethodsDiffusion
