NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos

Jinxi Li; Ziyang Song; Bo Yang

arXiv:2312.06398·cs.CV·December 12, 2023·1 cites

NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos

Jinxi Li, Ziyang Song, Bo Yang

PDF

Open Access 1 Repo 1 Video

TL;DR

NVFi introduces a novel approach to model 3D scene dynamics from multi-view videos, enabling applications like future frame prediction and 3D scene understanding without supervision.

Contribution

The paper presents NVFi, a method that jointly learns geometry, appearance, and velocity of 3D scenes from videos, with new datasets and superior performance in dynamic scene tasks.

Findings

01

Outperforms baseline methods in future frame extrapolation

02

Enables unsupervised 3D semantic scene decomposition

03

Effective in modeling complex 3D scene dynamics

Abstract

In this paper, we aim to model 3D scene dynamics from multi-view videos. Unlike the majority of existing works which usually focus on the common task of novel view synthesis within the training time period, we propose to simultaneously learn the geometry, appearance, and physical velocity of 3D scenes only from video frames, such that multiple desirable applications can be supported, including future frame extrapolation, unsupervised 3D semantic scene decomposition, and dynamic motion transfer. Our method consists of three major components, 1) the keyframe dynamic radiance field, 2) the interframe velocity field, and 3) a joint keyframe and interframe optimization module which is the core of our framework to effectively train both networks. To validate our method, we further introduce two dynamic 3D datasets: 1) Dynamic Object dataset, and 2) Dynamic Indoor Scene dataset. We conduct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vlar-group/nvfi
pytorchOfficial

Videos

NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Image Processing Techniques

MethodsFocus