# Light Field Video Capture Using a Learning-Based Hybrid Imaging System

**Authors:** Ting-Chun Wang, Jun-Yan Zhu, Nima Khademi Kalantari, Alexei A. Efros,, Ravi Ramamoorthi

arXiv: 1705.02997 · 2017-05-09

## TL;DR

This paper introduces a hybrid imaging system combining light field and standard video cameras, using learning-based methods to generate high-frame-rate light field videos from low-frame-rate light field data and high-frame-rate 2D video.

## Contribution

It presents a novel hybrid camera setup and a deep learning approach for converting sparse light field sequences into full light field videos at 30 fps.

## Key findings

- Outperforms existing video interpolation methods
- Enables consumer-level light field videography
- Facilitates applications like refocusing and parallax view generation

## Abstract

Light field cameras have many advantages over traditional cameras, as they allow the user to change various camera settings after capture. However, capturing light fields requires a huge bandwidth to record the data: a modern light field camera can only take three images per second. This prevents current consumer light field cameras from capturing light field videos. Temporal interpolation at such extreme scale (10x, from 3 fps to 30 fps) is infeasible as too much information will be entirely missing between adjacent frames. Instead, we develop a hybrid imaging system, adding another standard video camera to capture the temporal information. Given a 3 fps light field sequence and a standard 30 fps 2D video, our system can then generate a full light field video at 30 fps. We adopt a learning-based approach, which can be decomposed into two steps: spatio-temporal flow estimation and appearance estimation. The flow estimation propagates the angular information from the light field sequence to the 2D video, so we can warp input images to the target view. The appearance estimation then combines these warped images to output the final pixels. The whole process is trained end-to-end using convolutional neural networks. Experimental results demonstrate that our algorithm outperforms current video interpolation methods, enabling consumer light field videography, and making applications such as refocusing and parallax view generation achievable on videos for the first time.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.02997/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/1705.02997/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/1705.02997/full.md

---
Source: https://tomesphere.com/paper/1705.02997