Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects
Bangbang Yang, Yinda Zhang, Yijin Li, Zhaopeng Cui, Sean Fanello,, Hujun Bao, Guofeng Zhang

TL;DR
This paper introduces a novel neural rendering framework for amodal 3D scene understanding in closed environments, enabling realistic free-viewpoint rendering from a single panoramic image, with applications in scene touring and editing.
Contribution
The work presents a two-stage approach combining offline prior learning and online scene understanding with neural rendering, improving robustness and realism in free-viewpoint scene synthesis.
Findings
Outperforms existing methods on synthetic and real datasets
Achieves robust 3D scene understanding from a single panoramic image
Enables realistic free-viewpoint rendering for scene editing and touring
Abstract
We, as human beings, can understand and picture a familiar scene from arbitrary viewpoints given a single image, whereas this is still a grand challenge for computers. We hereby present a novel solution to mimic such human perception capability based on a new paradigm of amodal 3D scene understanding with neural rendering for a closed scene. Specifically, we first learn the prior knowledge of the objects in a closed scene via an offline stage, which facilitates an online stage to understand the room with unseen furniture arrangement. During the online stage, given a panoramic image of the scene in different layouts, we utilize a holistic neural-rendering-based optimization framework to efficiently estimate the correct 3D scene layout and deliver realistic free-viewpoint rendering. In order to handle the domain gap between the offline and online stage, our method exploits compositional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
