DeepHuman: 3D Human Reconstruction from a Single Image

Zerong Zheng; Tao Yu; Yixuan Wei; Qionghai Dai; Yebin Liu

arXiv:1903.06473·cs.CV·March 29, 2019·20 cites

DeepHuman: 3D Human Reconstruction from a Single Image

Zerong Zheng, Tao Yu, Yixuan Wei, Qionghai Dai, Yebin Liu

PDF

Open Access 1 Repo

TL;DR

DeepHuman introduces a novel CNN framework for 3D human reconstruction from a single RGB image, utilizing semantic guidance and multi-scale feature fusion to improve surface detail accuracy.

Contribution

The paper presents a new network architecture with volumetric feature transformation and a dense semantic input, along with a large real-world dataset for training.

Findings

01

Outperforms existing methods in 3D human reconstruction accuracy

02

Effectively reconstructs invisible surface areas using semantic guidance

03

Achieves detailed surface refinement through normal projection

Abstract

We propose DeepHuman, an image-guided volume-to-volume translation CNN for 3D human reconstruction from a single RGB image. To reduce the ambiguities associated with the surface geometry reconstruction, even for the reconstruction of invisible areas, we propose and leverage a dense semantic representation generated from SMPL model as an additional input. One key feature of our network is that it fuses different scales of image features into the 3D space through volumetric feature transformation, which helps to recover accurate surface geometry. The visible surface details are further refined through a normal refinement network, which can be concatenated with the volume generation network using our proposed volumetric normal projection layer. We also contribute THuman, a 3D real-world human model dataset containing about 7000 models. The network is trained using training data generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZhengZerong/DeepHuman
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition