EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior   for Sparse View

Zhaorong Wang; Yoshihiro Kanamori; Yuki Endo

arXiv:2410.12242·cs.CV·October 17, 2024

EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View

Zhaorong Wang, Yoshihiro Kanamori, Yuki Endo

PDF

Open Access

TL;DR

EG-HumanNeRF introduces a real-time, high-quality human NeRF framework that leverages human prior knowledge and efficient sampling strategies to improve rendering from sparse views, especially in occluded regions.

Contribution

The paper presents a novel human NeRF method that accelerates rendering using a two-stage sampling reduction and occlusion-aware attention, enhancing quality and speed with sparse inputs.

Findings

01

Outperforms state-of-the-art in rendering quality

02

Achieves real-time rendering speeds

03

Effectively handles occlusions with priors

Abstract

Generalizable neural radiance field (NeRF) enables neural-based digital human rendering without per-scene retraining. When combined with human prior knowledge, high-quality human rendering can be achieved even with sparse input views. However, the inference of these methods is still slow, as a large number of neural network queries on each ray are required to ensure the rendering quality. Moreover, occluded regions often suffer from artifacts, especially when the input views are sparse. To address these issues, we propose a generalizable human NeRF framework that achieves high-quality and real-time rendering with sparse input views by extensively leveraging human prior knowledge. We accelerate the rendering with a two-stage sampling reduction strategy: first constructing boundary meshes around the human geometry to reduce the number of ray samples for sampling guidance regression, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications

MethodsSoftmax · Attention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings