CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple   Images

Jisu Shin; Junmyeong Lee; Seongmin Lee; Min-Gyu Park; Ju-Mi Kang; Ju; Hong Yoon; Hae-Gon Jeon

arXiv:2407.04345·cs.CV·July 16, 2024

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Jisu Shin, Junmyeong Lee, Seongmin Lee, Min-Gyu Park, Ju-Mi Kang, Ju, Hong Yoon, Hae-Gon Jeon

PDF

Open Access 1 Repo

TL;DR

CanonicalFusion is a new framework that reconstructs animatable 3D human avatars from multiple images by integrating individual reconstructions into a canonical space, using a novel skinning and differentiable rendering approach.

Contribution

It introduces a method to predict compressed skinning weights and employs a forward skinning-based differentiable rendering scheme for improved 3D human avatar reconstruction.

Findings

01

Effective reconstruction of drivable 3D human avatars from multiple images.

02

Outperforms state-of-the-art methods in accuracy and robustness.

03

Open-source implementation available for reproducibility.

Abstract

We present a novel framework for reconstructing animatable human avatars from multiple images, termed CanonicalFusion. Our central concept involves integrating individual reconstruction results into the canonical space. To be specific, we first predict Linear Blend Skinning (LBS) weight maps and depth maps using a shared-encoder-dual-decoder network, enabling direct canonicalization of the 3D mesh from the predicted depth maps. Here, instead of predicting high-dimensional skinning weights, we infer compressed skinning weights, i.e., 3-dimensional vector, with the aid of pre-trained MLP networks. We also introduce a forward skinning-based differentiable rendering scheme to merge the reconstructed results from multiple images. This scheme refines the initial mesh by reposing the canonical mesh via the forward skinning and by minimizing photometric and geometric errors between the rendered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jsshin98/canonicalfusion
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis