MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images
Shaofei Wang, Marko Mihajlovic, Qianli Ma, Andreas Geiger, Siyu Tang

TL;DR
This paper introduces MetaAvatar, a meta-learning approach that quickly generates realistic, controllable clothed human models from minimal monocular depth images, outperforming existing methods in speed and realism.
Contribution
We propose a meta-learned hypernetwork that predicts neural SDFs for clothed humans from depth images, enabling fast, realistic avatar generation with minimal input data.
Findings
Outperforms state-of-the-art methods requiring full meshes.
Generates realistic cloth deformations from as few as 8 depth frames.
Runs significantly faster than existing approaches.
Abstract
In this paper, we aim to create generalizable and controllable neural signed distance fields (SDFs) that represent clothed humans from monocular depth observations. Recent advances in deep learning, especially neural implicit representations, have enabled human shape reconstruction and controllable avatar generation from different sensor inputs. However, to generate realistic cloth deformations from novel input poses, watertight meshes or dense full-body scans are usually needed as inputs. Furthermore, due to the difficulty of effectively modeling pose-dependent cloth deformations for diverse body shapes and cloth types, existing approaches resort to per-subject/cloth-type optimization from scratch, which is computationally expensive. In contrast, we propose an approach that can quickly generate realistic clothed human avatars, represented as controllable neural SDFs, given only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Human Pose and Action Recognition
MethodsHyperNetwork
