Neural Human Performer: Learning Generalizable Radiance Fields for Human Performance Rendering
Youngjoong Kwon, Dahun Kim, Duygu Ceylan, Henry Fuchs

TL;DR
This paper introduces Neural Human Performer, a method that learns generalizable neural radiance fields for synthesizing free-viewpoint videos of arbitrary human performances, effectively handling occlusions and articulations through temporal and multi-view transformers.
Contribution
It proposes a novel approach combining temporal and multi-view transformers with a parametric human body model for robust, generalizable human performance rendering.
Findings
Outperforms recent generalizable NeRF methods on unseen identities and poses.
Effectively handles occlusions and dynamic articulations.
Demonstrates superior performance on ZJU-MoCap and AIST datasets.
Abstract
In this paper, we aim at synthesizing a free-viewpoint video of an arbitrary human performance using sparse multi-view cameras. Recently, several works have addressed this problem by learning person-specific neural radiance fields (NeRF) to capture the appearance of a particular human. In parallel, some work proposed to use pixel-aligned features to generalize radiance fields to arbitrary new scenes and objects. Adopting such generalization approaches to humans, however, is highly challenging due to the heavy occlusions and dynamic articulations of body parts. To tackle this, we propose Neural Human Performer, a novel approach that learns generalizable neural radiance fields based on a parametric human body model for robust performance capture. Specifically, we first introduce a temporal transformer that aggregates tracked visual features based on the skeletal body motion over time.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition
MethodsFast Attention Via Positive Orthogonal Random Features · Performer
