Topology-aware Human Avatars with Semantically-guided Gaussian Splatting
Haoyu Zhao, Chen Yang, Hao Wang, Xingyue Zhao, Wei Shen

TL;DR
This paper introduces SG-GS, a novel method for reconstructing detailed, topology-aware human avatars from monocular videos by embedding semantic information into 3D Gaussians and employing skeleton-driven deformations.
Contribution
The paper proposes a semantics-embedded 3D Gaussian representation and a semantic-guided optimization framework for high-fidelity, topology-aware human avatar reconstruction.
Findings
Achieves state-of-the-art geometry reconstruction
Enhances semantic accuracy in Gaussian representations
Improves rendering quality of human avatars
Abstract
Reconstructing photo-realistic and topology-aware animatable human avatars from monocular videos remains challenging in computer vision and graphics. Recently, methods using 3D Gaussians to represent the human body have emerged, offering faster optimization and real-time rendering. However, due to ignoring the crucial role of human body semantic information which represents the explicit topological and intrinsic structure within human body, they fail to achieve fine-detail reconstruction of human avatars. To address this issue, we propose SG-GS, which uses semantics-embedded 3D Gaussians, skeleton-driven rigid deformation, and non-rigid cloth dynamics deformation to create photo-realistic human avatars. We then design a Semantic Human-Body Annotator (SHA) which utilizes SMPL's semantic prior for efficient body part semantic labeling. The generated labels are used to guide the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Advanced Vision and Imaging
