Panoptic Neural Fields: A Semantic Object-Aware Neural Scene   Representation

Abhijit Kundu; Kyle Genova; Xiaoqi Yin; Alireza Fathi; Caroline; Pantofaru; Leonidas Guibas; Andrea Tagliasacchi; Frank Dellaert; Thomas; Funkhouser

arXiv:2205.04334·cs.CV·May 10, 2022·1 cites

Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation

Abhijit Kundu, Kyle Genova, Xiaoqi Yin, Alireza Fathi, Caroline, Pantofaru, Leonidas Guibas, Andrea Tagliasacchi, Frank Dellaert, Thomas, Funkhouser

PDF

Open Access

TL;DR

Panoptic Neural Fields (PNF) is a neural scene representation that decomposes scenes into objects and background, enabling tasks like view synthesis, segmentation, and editing from color images.

Contribution

Introduces PNF, an object-aware neural scene model that efficiently represents scenes with object-specific MLPs and background, leveraging meta-learning and self-supervision.

Findings

01

Effective for novel view synthesis

02

Accurate 2D panoptic segmentation

03

Supports 3D scene editing and depth prediction

Abstract

We present Panoptic Neural Fields (PNF), an object-aware neural scene representation that decomposes a scene into a set of objects (things) and background (stuff). Each object is represented by an oriented 3D bounding box and a multi-layer perceptron (MLP) that takes position, direction, and time and outputs density and radiance. The background stuff is represented by a similar MLP that additionally outputs semantic labels. Each object MLPs are instance-specific and thus can be smaller and faster than previous object-aware approaches, while still leveraging category-specific priors incorporated via meta-learned initialization. Our model builds a panoptic radiance field representation of any scene from just color images. We use off-the-shelf algorithms to predict camera poses, object tracks, and 2D image semantic segmentations. Then we jointly optimize the MLP weights and bounding box…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis