Decomposing 3D Scenes into Objects via Unsupervised Volume Segmentation
Karl Stelzner, Kristian Kersting, Adam R. Kosiorek

TL;DR
ObSuRF is a novel method that decomposes a scene into individual object NeRFs from a single image, enabling unsupervised 3D object segmentation and geometry recovery with improved efficiency.
Contribution
The paper introduces ObSuRF, an unsupervised approach that segments 3D scenes into objects using neural radiance fields conditioned on latent vectors, trained efficiently on RGB-D data without explicit ray marching.
Findings
Achieves state-of-the-art results on 2D segmentation benchmarks.
Successfully recovers 3D geometry and segments objects from a single image.
Performs well on multi-object 3D datasets without supervision.
Abstract
We present ObSuRF, a method which turns a single image of a scene into a 3D model represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to a different object. A single forward pass of an encoder network outputs a set of latent vectors describing the objects in the scene. These vectors are used independently to condition a NeRF decoder, defining the geometry and appearance of each object. We make learning more computationally efficient by deriving a novel loss, which allows training NeRFs on RGB-D inputs without explicit ray marching. After confirming that the model performs equal or better than state of the art on three 2D image segmentation benchmarks, we apply it to two multi-object 3D datasets: A multiview version of CLEVR, and a novel dataset in which scenes are populated by ShapeNet models. We find that after training ObSuRF on RGB-D views of training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
MethodsRobinhood Customer Care Number +1-833-534-1729
