SparseGNV: Generating Novel Views of Indoor Scenes with Sparse Input   Views

Weihao Cheng; Yan-Pei Cao; Ying Shan

arXiv:2305.07024·cs.CV·May 12, 2023·1 cites

SparseGNV: Generating Novel Views of Indoor Scenes with Sparse Input Views

Weihao Cheng, Yan-Pei Cao, Ying Shan

PDF

Open Access 1 Repo

TL;DR

SparseGNV is a novel framework that generates photorealistic indoor scene views from sparse inputs by combining 3D geometry, transformer-based decoding, and learned priors, outperforming existing methods.

Contribution

It introduces a three-module learning framework integrating neural point clouds, transformers, and image reconstruction for efficient view synthesis.

Findings

01

Outperforms state-of-the-art methods on real-world indoor scenes.

02

Efficient feed-forward generation of novel views of unseen scenes.

03

Effective use of 3D structures and generative models for view synthesis.

Abstract

We study to generate novel views of indoor scenes given sparse input views. The challenge is to achieve both photorealism and view consistency. We present SparseGNV: a learning framework that incorporates 3D structures and image generative models to generate novel views with three modules. The first module builds a neural point cloud as underlying geometry, providing contextual information and guidance for the target novel view. The second module utilizes a transformer-based network to map the scene context and the guidance into a shared latent space and autoregressively decodes the target view in the form of discrete image tokens. The third module reconstructs the tokens into the image of the target view. SparseGNV is trained across a large indoor scene dataset to learn generalizable priors. Once trained, it can efficiently generate novel views of an unseen indoor scene in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xt4d/sparsegnv
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage · Advanced Vision and Imaging