PIFu for the Real World: A Self-supervised Framework to Reconstruct   Dressed Human from Single-view Images

Zhangyang Xiong; Dong Du; Yushuang Wu; Jingqi Dong; Di Kang; Linchao; Bao; and Xiaoguang Han

arXiv:2208.10769·cs.CV·March 11, 2024

PIFu for the Real World: A Self-supervised Framework to Reconstruct Dressed Human from Single-view Images

Zhangyang Xiong, Dong Du, Yushuang Wu, Jingqi Dong, Di Kang, Linchao, Bao, and Xiaoguang Han

PDF

Open Access

TL;DR

This paper introduces SelfPIFu, a self-supervised framework that improves 3D human reconstruction from single images by leveraging depth information and in-the-wild data, reducing reliance on expensive ground truth annotations.

Contribution

The work presents a novel self-supervised learning approach for PIFu that effectively utilizes depth cues and in-the-wild images, enhancing generalization and reconstruction quality without needing 3D ground truth.

Findings

01

Achieves 93.5% IoU on synthetic data, outperforming PIFuHD by 18%.

02

User studies show over 68% preference for SelfPIFu reconstructions.

03

Effectively utilizes depth information to improve reconstruction accuracy.

Abstract

It is very challenging to accurately reconstruct sophisticated human geometry caused by various poses and garments from a single image. Recently, works based on pixel-aligned implicit function (PIFu) have made a big step and achieved state-of-the-art fidelity on image-based 3D human digitization. However, the training of PIFu relies heavily on expensive and limited 3D ground truth data (i.e. synthetic data), thus hindering its generalization to more diverse real world images. In this work, we propose an end-to-end self-supervised network named SelfPIFu to utilize abundant and diverse in-the-wild images, resulting in largely improved reconstructions when tested on unconstrained in-the-wild images. At the core of SelfPIFu is the depth-guided volume-/surface-aware signed distance fields (SDF) learning, which enables self-supervised learning of a PIFu without access to GT mesh. The whole…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Optical measurement and interference techniques