Efficient Neural Implicit Representation for 3D Human Reconstruction

Zexu Huang; Sarah Monazam Erfani; Siying Lu; Mingming Gong

arXiv:2410.17741·cs.CV·October 24, 2024

Efficient Neural Implicit Representation for 3D Human Reconstruction

Zexu Huang, Sarah Monazam Erfani, Siying Lu, Mingming Gong

PDF

1 Repo

TL;DR

HumanAvatar is a novel method that efficiently reconstructs high-quality 3D human avatars from monocular videos by integrating advanced models and techniques, achieving faster training and superior accuracy compared to existing methods.

Contribution

The paper introduces HumanAvatar, combining HuMoR, Instant-NGP, and Fast-SNARF, with a posture-sensitive space reduction to significantly improve speed and quality of 3D human reconstruction from monocular videos.

Findings

01

Achieves 110X faster training than state-of-the-art NeRF models.

02

Provides comparable or better quality reconstructions in minutes.

03

Effective visual results after only 30 seconds of training.

Abstract

High-fidelity digital human representations are increasingly in demand in the digital world, particularly for interactive telepresence, AR/VR, 3D graphics, and the rapidly evolving metaverse. Even though they work well in small spaces, conventional methods for reconstructing 3D human motion frequently require the use of expensive hardware and have high processing costs. This study presents HumanAvatar, an innovative approach that efficiently reconstructs precise human avatars from monocular video sources. At the core of our methodology, we integrate the pre-trained HuMoR, a model celebrated for its proficiency in human motion estimation. This is adeptly fused with the cutting-edge neural radiance field technology, Instant-NGP, and the state-of-the-art articulated model, Fast-SNARF, to enhance the reconstruction fidelity and speed. By combining these two technologies, a system is created…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HZXu-526/Human-Avatar
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings