Deep Learning based Novel View Synthesis
Amit More, Subhasis Chaudhuri

TL;DR
This paper introduces a deep CNN that predicts novel scene views from varying numbers of input images, explicitly models depth and occlusion, and combines multiple predictions for improved quality, demonstrating competitive results.
Contribution
It presents a flexible deep learning model capable of handling different input image counts and explicitly estimating depth and occlusion for novel view synthesis.
Findings
Achieves competitive results on multiple datasets.
Handles varying numbers of input images.
Improves view quality with multi-resolution analysis.
Abstract
Predicting novel views of a scene from real-world images has always been a challenging task. In this work, we propose a deep convolutional neural network (CNN) which learns to predict novel views of a scene from given collection of images. In comparison to prior deep learning based approaches, which can handle only a fixed number of input images to predict novel view, proposed approach works with different numbers of input images. The proposed model explicitly performs feature extraction and matching from a given pair of input images and estimates, at each pixel, the probability distribution (pdf) over possible depth levels in the scene. This pdf is then used for estimating the novel view. The model estimates multiple predictions of novel view, one estimate per input image pair, from given image collection. The model also estimates an occlusion mask and combines multiple novel view…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Video Surveillance and Tracking Methods · Image Enhancement Techniques
