Topology-aware Human Avatars with Semantically-guided Gaussian Splatting

Haoyu Zhao; Chen Yang; Hao Wang; Xingyue Zhao; Wei Shen

arXiv:2408.09665·cs.CV·November 20, 2024

Topology-aware Human Avatars with Semantically-guided Gaussian Splatting

Haoyu Zhao, Chen Yang, Hao Wang, Xingyue Zhao, Wei Shen

PDF

Open Access

TL;DR

This paper introduces SG-GS, a novel method for reconstructing detailed, topology-aware human avatars from monocular videos by embedding semantic information into 3D Gaussians and employing skeleton-driven deformations.

Contribution

The paper proposes a semantics-embedded 3D Gaussian representation and a semantic-guided optimization framework for high-fidelity, topology-aware human avatar reconstruction.

Findings

01

Achieves state-of-the-art geometry reconstruction

02

Enhances semantic accuracy in Gaussian representations

03

Improves rendering quality of human avatars

Abstract

Reconstructing photo-realistic and topology-aware animatable human avatars from monocular videos remains challenging in computer vision and graphics. Recently, methods using 3D Gaussians to represent the human body have emerged, offering faster optimization and real-time rendering. However, due to ignoring the crucial role of human body semantic information which represents the explicit topological and intrinsic structure within human body, they fail to achieve fine-detail reconstruction of human avatars. To address this issue, we propose SG-GS, which uses semantics-embedded 3D Gaussians, skeleton-driven rigid deformation, and non-rigid cloth dynamics deformation to create photo-realistic human avatars. We then design a Semantic Human-Body Annotator (SHA) which utilizes SMPL's semantic prior for efficient body part semantic labeling. The generated labels are used to guide the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · Advanced Vision and Imaging