GHuNeRF: Generalizable Human NeRF from a Monocular Video
Chen Li, Jiahao Lin, Gim Hee Lee

TL;DR
GHuNeRF introduces a novel method for learning a generalizable human NeRF model from monocular videos, utilizing a visibility-aware aggregation and attention-enhanced features to achieve competitive results without multi-view data.
Contribution
The paper proposes GHuNeRF, a new approach that enables human NeRF modeling from monocular videos, overcoming previous limitations of requiring multi-view data or poor generalization to unseen identities.
Findings
Achieves comparable performance to multi-view methods on ZJU-MoCap.
Outperforms existing monocular methods on People-Snapshot dataset.
Uses attention mechanisms to enhance feature volume accuracy.
Abstract
In this paper, we tackle the challenging task of learning a generalizable human NeRF model from a monocular video. Although existing generalizable human NeRFs have achieved impressive results, they require muti-view images or videos which might not be always available. On the other hand, some works on free-viewpoint rendering of human from monocular videos cannot be generalized to unseen identities. In view of these limitations, we propose GHuNeRF to learn a generalizable human NeRF model from a monocular video of the human performer. We first introduce a visibility-aware aggregation scheme to compute vertex-wise features, which is used to construct a 3D feature volume. The feature volume can only represent the overall geometry of the human performer with insufficient accuracy due to the limited resolution. To solve this, we further enhance the volume feature with temporally aligned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
MethodsFast Attention Via Positive Orthogonal Random Features · Performer
