D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from   Monocular Video

Moritz Kappel; Florian Hahlbohm; Timon Scholz; Susana Castillo,; Christian Theobalt; Martin Eisemann; Vladislav Golyanik; and Marcus Magnor

arXiv:2406.10078·cs.CV·March 3, 2025·1 cites

D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video

Moritz Kappel, Florian Hahlbohm, Timon Scholz, Susana Castillo,, Christian Theobalt, Martin Eisemann, Vladislav Golyanik, and Marcus Magnor

PDF

Open Access

TL;DR

This paper introduces D-NPC, a novel dynamic neural point cloud method for non-rigid view synthesis from monocular videos, enabling fast, high-quality rendering with explicit scene representation and data-driven priors.

Contribution

The paper presents a dynamic neural point cloud approach that efficiently reconstructs non-rigid scenes from monocular video, incorporating priors to improve speed and quality.

Findings

01

Achieves real-time rendering speeds suitable for interactive applications.

02

Provides competitive image quality on monocular benchmark sequences.

03

Enables efficient optimization through explicit scene initialization.

Abstract

Dynamic reconstruction and spatiotemporal novel-view synthesis of non-rigidly deforming scenes recently gained increased attention. While existing work achieves impressive quality and performance on multi-view or teleporting camera setups, most methods fail to efficiently and faithfully recover motion and appearance from casual monocular captures. This paper contributes to the field by introducing a new method for dynamic novel view synthesis from monocular video, such as casual smartphone captures. Our approach represents the scene as a $dynamic neural point cloud$ , an implicit time-conditioned point distribution that encodes local geometry and appearance in separate hash-encoded neural feature grids for static and dynamic regions. By sampling a discrete point cloud from our model, we can efficiently render high-quality novel views using a fast differentiable rasterizer and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · 3D Shape Modeling and Analysis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings