IBRNet: Learning Multi-View Image-Based Rendering

Qianqian Wang; Zhicheng Wang; Kyle Genova; Pratul Srinivasan; Howard; Zhou; Jonathan T. Barron; Ricardo Martin-Brualla; Noah Snavely; Thomas; Funkhouser

arXiv:2102.13090·cs.CV·April 8, 2021·27 cites

IBRNet: Learning Multi-View Image-Based Rendering

Qianqian Wang, Zhicheng Wang, Kyle Genova, Pratul Srinivasan, Howard, Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, Thomas, Funkhouser

PDF

Open Access 1 Repo

TL;DR

IBRNet is a neural network architecture that synthesizes novel views of complex scenes by interpolating sparse source views, combining classic image-based rendering principles with modern neural techniques for high-quality, generalizable view synthesis.

Contribution

We introduce a neural network with a ray transformer that estimates radiance and density from multiple views, enabling generalizable, high-resolution novel view synthesis from sparse inputs.

Findings

01

Outperforms recent generalizable view synthesis methods.

02

Competitive with single-scene neural rendering when fine-tuned.

03

Effective volume rendering trained with multi-view posed images.

Abstract

We present a method that synthesizes novel views of complex scenes by interpolating a sparse set of nearby views. The core of our method is a network architecture that includes a multilayer perceptron and a ray transformer that estimates radiance and volume density at continuous 5D locations (3D spatial locations and 2D viewing directions), drawing appearance information on the fly from multiple source views. By drawing on source views at render time, our method hearkens back to classic work on image-based rendering (IBR), and allows us to render high-resolution imagery. Unlike neural scene representation work that optimizes per-scene functions for rendering, we learn a generic view interpolation function that generalizes to novel scenes. We render images using classic volume rendering, which is fully differentiable and allows us to train using only multi-view posed images as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

googleinterns/ibrnet
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis

MethodsRobinhood Customer Care Number +1-833-534-1729