HumanSplatHMR: Closing the Loop Between Human Mesh Recovery and Gaussian Splatting Avatar

Yeheng Zong; Pou-Chun Kung; Yike Pan; Seth Isaacson; Yizhou Chen; Ram Vasudevan; Katherine A. Skinner

arXiv:2605.02784·cs.CV·May 22, 2026

HumanSplatHMR: Closing the Loop Between Human Mesh Recovery and Gaussian Splatting Avatar

Yeheng Zong, Pou-Chun Kung, Yike Pan, Seth Isaacson, Yizhou Chen, Ram Vasudevan, Katherine A. Skinner

PDF

TL;DR

HumanSplatHMR introduces a joint optimization framework that refines 3D human pose and learns high-fidelity avatars from video, improving accuracy and rendering quality without relying on motion capture.

Contribution

It closes the loop between pose estimation and differentiable rendering, enabling better in-the-wild human avatar reconstruction from only mesh estimates.

Findings

01

Improves 3D pose accuracy over baseline methods

02

Enhances avatar rendering from novel views and poses

03

Outperforms existing pose and avatar reconstruction methods

Abstract

Accurately recovering human pose and appearance from video is an essential component of scene reconstruction, with applications to motion capture, motion prediction, virtual reality, and digital twinning. Despite significant interest in building realistic human avatars from video, this paper demonstrates that existing methods do not accurately recover the 3D geometry of humans. ViT-based approaches are not consistently reliable and can overfit to 2D views, while NeRF- and Gaussian Splatting-based avatars treat pose and appearance separately, limiting rendering generalization to new poses. To resolve these shortcomings, this paper proposes HumanSplatHMR, a joint optimization framework that refines 3D human poses while simultaneously learning a high-fidelity avatar for novel-view and novel-pose synthesis. Our key insight is to close the loop between geometric pose estimation and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.