P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments
Xujie Kang, Kanglin Liu, Jiang Duan, Yuanhao Gong, Guoping, Qiu

TL;DR
This paper introduces P2I-NET, a generative adversarial network that efficiently synthesizes new indoor views from given camera poses, outperforming existing methods in speed while maintaining high image quality.
Contribution
The paper presents a novel adversarial network architecture with auxiliary constraints for accurate view synthesis from camera poses in indoor environments, along with a new high-resolution RGBD dataset.
Findings
P2I-NET outperforms NeRF-based models in image quality.
P2I-NET is 40 to 100 times faster than baseline methods.
A new high-resolution indoor RGBD dataset is provided.
Abstract
Given a new camera pose in an indoor environment, we study the challenging problem of predicting the view from that pose based on a set of reference RGBD views. Existing explicit or implicit 3D geometry construction methods are computationally expensive while those based on learning have predominantly focused on isolated views of object categories with regular geometric structure. Differing from the traditional \textit{render-inpaint} approach to new view synthesis in the real indoor environment, we propose a conditional generative adversarial neural network (P2I-NET) to directly predict the new view from the given pose. P2I-NET learns the conditional distribution of the images of the environment for establishing the correspondence between the camera pose and its view of the environment, and achieves this through a number of innovative designs in its architecture and training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
