Semantic-Human: Neural Rendering of Humans from Monocular Video with Human Parsing
Jie Zhang, Pengcheng Shi, Zaiwang Gu, Yiyang Zhou, Zhi Wang

TL;DR
Semantic-Human is a novel neural rendering method that jointly encodes semantics, appearance, and geometry to produce photorealistic human images with consistent human parsing across views, even with noisy supervision.
Contribution
It extends neural radiance fields to incorporate semantic information and introduces constraints from SMPL for improved motion and geometry modeling, enabling view-consistent human parsing.
Findings
Achieves accurate 2D semantic labels with noisy pseudo-label supervision.
Demonstrates highly competitive results on ZJU-MoCap dataset.
Enables applications like label denoising, synthesis, and image editing.
Abstract
The neural rendering of humans is a topic of great research significance. However, previous works mostly focus on achieving photorealistic details, neglecting the exploration of human parsing. Additionally, classical semantic work are all limited in their ability to efficiently represent fine results in complex motions. Human parsing is inherently related to radiance reconstruction, as similar appearance and geometry often correspond to similar semantic part. Furthermore, previous works often design a motion field that maps from the observation space to the canonical space, while it tends to exhibit either underfitting or overfitting, resulting in limited generalization. In this paper, we present Semantic-Human, a novel method that achieves both photorealistic details and viewpoint-consistent human parsing for the neural rendering of humans. Specifically, we extend neural radiance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Neural Network Applications
MethodsFocus
