Semantic-Human: Neural Rendering of Humans from Monocular Video with   Human Parsing

Jie Zhang; Pengcheng Shi; Zaiwang Gu; Yiyang Zhou; Zhi Wang

arXiv:2308.09894·cs.CV·August 22, 2023

Semantic-Human: Neural Rendering of Humans from Monocular Video with Human Parsing

Jie Zhang, Pengcheng Shi, Zaiwang Gu, Yiyang Zhou, Zhi Wang

PDF

Open Access

TL;DR

Semantic-Human is a novel neural rendering method that jointly encodes semantics, appearance, and geometry to produce photorealistic human images with consistent human parsing across views, even with noisy supervision.

Contribution

It extends neural radiance fields to incorporate semantic information and introduces constraints from SMPL for improved motion and geometry modeling, enabling view-consistent human parsing.

Findings

01

Achieves accurate 2D semantic labels with noisy pseudo-label supervision.

02

Demonstrates highly competitive results on ZJU-MoCap dataset.

03

Enables applications like label denoising, synthesis, and image editing.

Abstract

The neural rendering of humans is a topic of great research significance. However, previous works mostly focus on achieving photorealistic details, neglecting the exploration of human parsing. Additionally, classical semantic work are all limited in their ability to efficiently represent fine results in complex motions. Human parsing is inherently related to radiance reconstruction, as similar appearance and geometry often correspond to similar semantic part. Furthermore, previous works often design a motion field that maps from the observation space to the canonical space, while it tends to exhibit either underfitting or overfitting, resulting in limited generalization. In this paper, we present Semantic-Human, a novel method that achieves both photorealistic details and viewpoint-consistent human parsing for the neural rendering of humans. Specifically, we extend neural radiance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Neural Network Applications

MethodsFocus