DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images
Bing Wang, Lu Chen, Bo Yang

TL;DR
DM-NeRF is a novel method that enables 3D scene decomposition and manipulation directly from 2D images using neural radiance fields, with effective object separation and editing capabilities.
Contribution
It introduces an object field component with specialized loss functions and an inverse query algorithm for free 3D object manipulation within a unified neural scene representation.
Findings
Accurately decomposes 3D objects from 2D views
Enables free manipulation like translation, rotation, and deformation
Handles object collisions and occlusions explicitly
Abstract
In this paper, we study the problem of 3D scene geometry decomposition and manipulation from 2D views. By leveraging the recent implicit neural representation techniques, particularly the appealing neural radiance fields, we introduce an object field component to learn unique codes for all individual objects in 3D space only from 2D supervision. The key to this component is a series of carefully designed loss functions to enable every 3D point, especially in non-occupied space, to be effectively optimized even without 3D labels. In addition, we introduce an inverse query algorithm to freely manipulate any specified 3D object shape in the learned scene representation. Notably, our manipulation algorithm can explicitly tackle key issues such as object collisions and visual occlusions. Our method, called DM-NeRF, is among the first to simultaneously reconstruct, decompose, manipulate and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Human Pose and Action Recognition
