Learning Efficient and Generalizable Human Representation with Human Gaussian Model

Yifan Liu; Shengjun Zhang; Chensheng Dai; Yang Chen; Hao Liu; Chen Li; Yueqi Duan

arXiv:2507.18758·cs.CV·July 28, 2025

Learning Efficient and Generalizable Human Representation with Human Gaussian Model

Yifan Liu, Shengjun Zhang, Chensheng Dai, Yang Chen, Hao Liu, Chen Li, Yueqi Duan

PDF

Open Access

TL;DR

This paper introduces the Human Gaussian Graph, a novel framework that models relationships between 3D Gaussians and human mesh vertices, enabling efficient and generalizable human avatar reconstruction from videos.

Contribution

The paper proposes the Human Gaussian Graph with intra- and inter-node operations to leverage information across frames for improved human avatar modeling.

Findings

01

Effective in novel view synthesis

02

Supports pose animation

03

Outperforms previous methods in efficiency and generalization

Abstract

Modeling animatable human avatars from videos is a long-standing and challenging problem. While conventional methods require per-instance optimization, recent feed-forward methods have been proposed to generate 3D Gaussians with a learnable network. However, these methods predict Gaussians for each frame independently, without fully capturing the relations of Gaussians from different timestamps. To address this, we propose Human Gaussian Graph to model the connection between predicted Gaussians and human SMPL mesh, so that we can leverage information from all frames to recover an animatable human representation. Specifically, the Human Gaussian Graph contains dual layers where Gaussians are the first layer nodes and mesh vertices serve as the second layer nodes. Based on this structure, we further propose the intra-node operation to aggregate various Gaussians connected to one mesh…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis