Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural   Human Rendering

Mingfei Chen; Jianfeng Zhang; Xiangyu Xu; Lijuan Liu; Yujun Cai,; Jiashi Feng; Shuicheng Yan

arXiv:2112.04312·cs.CV·March 31, 2022

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering

Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai,, Jiashi Feng, Shuicheng Yan

PDF

Open Access

TL;DR

This paper introduces GP-NeRF, a geometry-guided progressive NeRF approach that enhances generalizable and efficient neural human rendering, especially under sparse views and self-occlusion, with significant speed improvements.

Contribution

It proposes a novel geometry-guided multi-view feature integration and progressive rendering pipeline for improved human body synthesis.

Findings

01

Outperforms state-of-the-art methods on ZJU-MoCap and THUman datasets.

02

Reduces rendering time by over 70%.

03

Effectively handles self-occlusion and sparse view inputs.

Abstract

In this work we develop a generalizable and efficient Neural Radiance Field (NeRF) pipeline for high-fidelity free-viewpoint human body synthesis under settings with sparse camera views. Though existing NeRF-based methods can synthesize rather realistic details for human body, they tend to produce poor results when the input has self-occlusion, especially for unseen humans under sparse views. Moreover, these methods often require a large number of sampling points for rendering, which leads to low efficiency and limits their real-world applicability. To address these challenges, we propose a Geometry-guided Progressive NeRF (GP-NeRF). In particular, to better tackle self-occlusion, we devise a geometry-guided multi-view feature integration approach that utilizes the estimated geometry prior to integrate the incomplete information from input views and construct a complete geometry volume…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · 3D Shape Modeling and Analysis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings