Parametric Gaussian Human Model: Generalizable Prior for Efficient and Realistic Human Avatar Modeling

Cheng Peng; Jingxiang Sun; Yushuo Chen; Zhaoqi Su; Zhuo Su; Yebin Liu

arXiv:2506.06645·cs.CV·June 10, 2025

Parametric Gaussian Human Model: Generalizable Prior for Efficient and Realistic Human Avatar Modeling

Cheng Peng, Jingxiang Sun, Yushuo Chen, Zhaoqi Su, Zhuo Su, Yebin Liu

PDF

Open Access

TL;DR

The paper introduces PGHM, a novel framework that leverages human priors for fast, high-fidelity 3D human avatar reconstruction from monocular videos, addressing efficiency and generalization issues in existing methods.

Contribution

PGHM integrates a UV-aligned latent identity map and a Multi-Head U-Net to enable rapid, high-quality avatar creation with minimal optimization time from monocular inputs.

Findings

01

Achieves high-quality avatars in about 20 minutes per subject.

02

Outperforms existing methods in efficiency while maintaining visual fidelity.

03

Demonstrates robustness under challenging poses and viewpoints.

Abstract

Photorealistic and animatable human avatars are a key enabler for virtual/augmented reality, telepresence, and digital entertainment. While recent advances in 3D Gaussian Splatting (3DGS) have greatly improved rendering quality and efficiency, existing methods still face fundamental challenges, including time-consuming per-subject optimization and poor generalization under sparse monocular inputs. In this work, we present the Parametric Gaussian Human Model (PGHM), a generalizable and efficient framework that integrates human priors into 3DGS for fast and high-fidelity avatar reconstruction from monocular videos. PGHM introduces two core components: (1) a UV-aligned latent identity map that compactly encodes subject-specific geometry and appearance into a learnable feature tensor; and (2) a disentangled Multi-Head U-Net that predicts Gaussian attributes by decomposing static,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Human Pose and Action Recognition