Efficient and Scalable View Generation from a Single Image using Fully   Convolutional Networks

Sung-Ho Bae; Mohamed Elgharib; Mohamed Hefeeda; Wojciech Matusik

arXiv:1705.03737·cs.CV·September 6, 2019·2 cites

Efficient and Scalable View Generation from a Single Image using Fully Convolutional Networks

Sung-Ho Bae, Mohamed Elgharib, Mohamed Hefeeda, Wojciech Matusik

PDF

Open Access

TL;DR

This paper introduces two fully convolutional network architectures for single-image view generation that significantly improve accuracy, speed, and memory efficiency over existing methods, enabling scalable 3D content creation.

Contribution

It proposes novel FCN-based architectures for SIVG that outperform prior CNN-based methods in accuracy and efficiency, and provides a large dataset for training.

Findings

01

DeepView$_{ren}$ is the fastest with 5x speed and lowest memory use.

02

DeepView$_{dec}$ achieves higher accuracy with 2.5x faster speed.

03

Both architectures outperform state-of-the-art methods in accuracy and efficiency.

Abstract

Single-image-based view generation (SIVG) is important for producing 3D stereoscopic content. Here, handling different spatial resolutions as input and optimizing both reconstruction accuracy and processing speed is desirable. Latest approaches are based on convolutional neural network (CNN), and they generate promising results. However, their use of fully connected layers as well as pre-trained VGG forces a compromise between reconstruction accuracy and processing speed. In addition, this approach is limited to the use of a specific spatial resolution. To remedy these problems, we propose exploiting fully convolutional networks (FCN) for SIVG. We present two FCN architectures for SIVG. The first one is based on combination of an FCN and a view-rendering network called DeepView $_{r e n}$ . The second one consists of decoupled networks for luminance and chrominance signals, denoted by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Dropout · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · Convolution · Ethereum Customer Service Number +1-833-534-1729 · Fully Convolutional Network