GHuNeRF: Generalizable Human NeRF from a Monocular Video

Chen Li; Jiahao Lin; Gim Hee Lee

arXiv:2308.16576·cs.CV·December 13, 2023

GHuNeRF: Generalizable Human NeRF from a Monocular Video

Chen Li, Jiahao Lin, Gim Hee Lee

PDF

Open Access 1 Repo

TL;DR

GHuNeRF introduces a novel method for learning a generalizable human NeRF model from monocular videos, utilizing a visibility-aware aggregation and attention-enhanced features to achieve competitive results without multi-view data.

Contribution

The paper proposes GHuNeRF, a new approach that enables human NeRF modeling from monocular videos, overcoming previous limitations of requiring multi-view data or poor generalization to unseen identities.

Findings

01

Achieves comparable performance to multi-view methods on ZJU-MoCap.

02

Outperforms existing monocular methods on People-Snapshot dataset.

03

Uses attention mechanisms to enhance feature volume accuracy.

Abstract

In this paper, we tackle the challenging task of learning a generalizable human NeRF model from a monocular video. Although existing generalizable human NeRFs have achieved impressive results, they require muti-view images or videos which might not be always available. On the other hand, some works on free-viewpoint rendering of human from monocular videos cannot be generalized to unseen identities. In view of these limitations, we propose GHuNeRF to learn a generalizable human NeRF model from a monocular video of the human performer. We first introduce a visibility-aware aggregation scheme to compute vertex-wise features, which is used to construct a 3D feature volume. The feature volume can only represent the overall geometry of the human performer with insufficient accuracy due to the limited resolution. To solve this, we further enhance the volume feature with temporally aligned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chaneyddtt/ghunerf
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications

MethodsFast Attention Via Positive Orthogonal Random Features · Performer