DeepStereo: Learning to Predict New Views from the World's Imagery
John Flynn, Ivan Neulander, James Philbin, Noah Snavely

TL;DR
This paper introduces DeepStereo, a deep learning system that synthesizes new views from posed image sets, offering a general, end-to-end approach that produces high-quality images for complex scenes.
Contribution
It presents the first deep learning method for new view synthesis from real-world imagery, trained end-to-end to generate unseen views directly from pixels.
Findings
Successfully reproduces known test views from nearby images.
Generates high-quality images from novel viewpoints.
Applicable to various domains with minimal tuning.
Abstract
Deep networks have recently enjoyed enormous success when applied to recognition and classification problems in computer vision, but their use in graphics problems has been limited. In this work, we present a novel deep architecture that performs new view synthesis directly from pixels, trained from a large number of posed image sets. In contrast to traditional approaches which consist of multiple complex stages of processing, each of which require careful tuning and can fail in unexpected ways, our system is trained end-to-end. The pixels from neighboring views of a scene are presented to the network which then directly produces the pixels of the unseen view. The benefits of our approach include generality (we only require posed image sets and can easily apply our method to different domains), and high quality results on traditionally difficult scenes. We believe this is due to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
DeepStereo: Learning to Predict New Views From the World’s Imagery· youtube
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Advanced Image Processing Techniques
