Dynamic Scene Understanding through Object-Centric Voxelization and   Neural Rendering

Yanpeng Zhao; Yiwei Hao; Siyu Gao; Yunbo Wang; Xiaokang Yang

arXiv:2407.20908·cs.CV·February 17, 2025

Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering

Yanpeng Zhao, Yiwei Hao, Siyu Gao, Yunbo Wang, Xiaokang Yang

PDF

Open Access 1 Repo

TL;DR

DynaVol-S is a novel 3D generative model that captures dynamic scenes with object-centric voxelization and neural rendering, enabling unsupervised scene understanding, semantic segmentation, and scene editing.

Contribution

It introduces a 3D object-centric voxelization framework with semantic integration within a differentiable volume rendering pipeline, advancing dynamic scene modeling and unsupervised decomposition.

Findings

01

Outperforms existing models in novel view synthesis

02

Enables unsupervised scene decomposition

03

Supports scene editing and manipulation

Abstract

Learning object-centric representations from unsupervised videos is challenging. Unlike most previous approaches that focus on decomposing 2D images, we present a 3D generative model named DynaVol-S for dynamic scenes that enables object-centric learning within a differentiable volume rendering framework. The key idea is to perform object-centric voxelization to capture the 3D nature of the scene, which infers per-object occupancy probabilities at individual spatial locations. These voxel features evolve through a canonical-space deformation function and are optimized in an inverse rendering pipeline with a compositional NeRF. Additionally, our approach integrates 2D semantic features to create 3D semantic grids, representing the scene through multiple disentangled voxel grids. DynaVol-S significantly outperforms existing models in both novel view synthesis and unsupervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zyp123494/dynavol
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction

MethodsFocus