NeRFPlayer: A Streamable Dynamic Scene Representation with Decomposed   Neural Radiance Fields

Liangchen Song; Anpei Chen; Zhong Li; Zhang Chen; Lele Chen; Junsong; Yuan; Yi Xu; Andreas Geiger

arXiv:2210.15947·cs.CV·February 21, 2023·6 cites

NeRFPlayer: A Streamable Dynamic Scene Representation with Decomposed Neural Radiance Fields

Liangchen Song, Anpei Chen, Zhong Li, Zhang Chen, Lele Chen, Junsong, Yuan, Yi Xu, Andreas Geiger

PDF

Open Access

TL;DR

NeRFPlayer introduces a fast, streamable neural radiance field framework for dynamic scene reconstruction and rendering from minimal camera input, enabling real-time exploration in VR.

Contribution

It decomposes 4D space into static, deforming, and new areas with separate neural fields and employs a hybrid feature streaming scheme for efficient modeling.

Findings

01

Achieves reconstruction in 10 seconds per frame

02

Provides interactive rendering performance

03

Works with single or multiple camera setups

Abstract

Visually exploring in a real-world 4D spatiotemporal space freely in VR has been a long-term quest. The task is especially appealing when only a few or even single RGB cameras are used for capturing the dynamic scene. To this end, we present an efficient framework capable of fast reconstruction, compact modeling, and streamable rendering. First, we propose to decompose the 4D spatiotemporal space according to temporal characteristics. Points in the 4D space are associated with probabilities of belonging to three categories: static, deforming, and new areas. Each area is represented and regularized by a separate neural field. Second, we propose a hybrid representations based feature streaming scheme for efficiently modeling the neural fields. Our approach, coined NeRFPlayer, is evaluated on dynamic scenes captured by single hand-held cameras and multi-camera arrays, achieving comparable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Human Pose and Action Recognition

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings