High-Resolution Volumetric Reconstruction for Clothed Humans
Sicong Tang, Guangyuan Wang, Qing Ran, Lingzhi Li, Li Shen, Ping, Tan

TL;DR
This paper introduces a volumetric reconstruction method for clothed humans from few images, leveraging 3D convolutions and a coarse-to-fine strategy to achieve high accuracy and realistic rendering.
Contribution
It demonstrates that a well-designed volumetric approach with system optimizations outperforms recent implicit methods in clothed human reconstruction.
Findings
Reduces point-to-surface error by over 50% to about 2mm.
Achieves higher PSNR in rendered images compared to state-of-the-art.
Uses a coarse-to-fine strategy with voxel culling for efficiency.
Abstract
We present a novel method for reconstructing clothed humans from a sparse set of, e.g., 1 to 6 RGB images. Despite impressive results from recent works employing deep implicit representation, we revisit the volumetric approach and demonstrate that better performance can be achieved with proper system design. The volumetric representation offers significant advantages in leveraging 3D spatial context through 3D convolutions, and the notorious quantization error is largely negligible with a reasonably large yet affordable volume resolution, e.g., 512. To handle memory and computation costs, we propose a sophisticated coarse-to-fine strategy with voxel culling and subspace sparse convolution. Our method starts with a discretized visual hull to compute a coarse shape and then focuses on a narrow band nearby the coarse shape for refinement. Once the shape is reconstructed, we adopt an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Color Science and Applications
