DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization
Cheng Zhang, Zhaopeng Cui, Cai Chen, Shuaicheng Liu, Bing Zeng, Hujun, Bao, Yinda Zhang

TL;DR
DeepPanoContext introduces a novel panoramic 3D scene understanding approach that leverages a holistic scene context graph and relation-based optimization to accurately recover room layouts and object details from a single panorama.
Contribution
The paper proposes a new graph neural network-based context model and a differentiable optimization module for panoramic 3D scene understanding, along with a synthetic dataset for training.
Findings
Outperforms existing methods in geometry accuracy
Achieves better object arrangement results
Demonstrates effectiveness on a new synthetic dataset
Abstract
Panorama images have a much larger field-of-view thus naturally encode enriched scene context information compared to standard perspective images, which however is not well exploited in the previous scene understanding methods. In this paper, we propose a novel method for panoramic 3D scene understanding which recovers the 3D room layout and the shape, pose, position, and semantic category for each object from a single full-view panorama image. In order to fully utilize the rich context information, we design a novel graph neural network based context model to predict the relationship among objects and room layout, and a differentiable relationship-based optimization module to optimize object arrangement with well-designed objective functions on-the-fly. Realizing the existing data are either with incomplete ground truth or overly-simplified scene, we present a new synthetic dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage
MethodsGraph Neural Network
