DressRecon: Freeform 4D Human Reconstruction from Monocular Video

Jeff Tan; Donglai Xiang; Shubham Tulsiani; Deva Ramanan; Gengshan Yang

arXiv:2409.20563·cs.CV·October 10, 2024

DressRecon: Freeform 4D Human Reconstruction from Monocular Video

Jeff Tan, Donglai Xiang, Shubham Tulsiani, Deva Ramanan, Gengshan Yang

PDF

Open Access

TL;DR

DressRecon introduces a novel method for reconstructing detailed, time-consistent 3D human models from monocular videos, effectively handling loose clothing and object interactions without requiring multi-view setups or personalized templates.

Contribution

It combines generic human priors with video-specific deformation models using neural implicit representations for flexible, high-quality 3D human reconstruction from monocular videos.

Findings

01

Outperforms prior methods on challenging clothing and object interaction datasets.

02

Produces high-fidelity, time-consistent 3D meshes suitable for rendering.

03

Leverages image-based priors for capturing subtle clothing geometry.

Abstract

We present a method to reconstruct time-consistent human body models from monocular videos, focusing on extremely loose clothing or handheld object interactions. Prior work in human reconstruction is either limited to tight clothing with no object interactions, or requires calibrated multi-view captures or personalized template scans which are costly to collect at scale. Our key insight for high-quality yet flexible reconstruction is the careful combination of generic human priors about articulated body shape (learned from large-scale training data) with video-specific articulated "bag-of-bones" deformation (fit to a single video via test-time optimization). We accomplish this by learning a neural implicit model that disentangles body versus clothing deformations as separate motion model layers. To capture subtle geometry of clothing, we leverage image-based priors such as human body…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis · Anatomy and Medical Technology