Depth estimation from 4D light field videos

Takahiro Kinoshita; Satoshi Ono

arXiv:2012.03021·cs.CV·July 9, 2024

Depth estimation from 4D light field videos

Takahiro Kinoshita, Satoshi Ono

PDF

1 Repo

TL;DR

This paper introduces an end-to-end neural network for depth estimation from 4D light field videos, leveraging temporal information to improve accuracy, and provides a synthetic dataset for training and evaluation.

Contribution

It presents the first neural network architecture specifically designed for depth estimation from 4D LF videos, incorporating temporal data and offering a new synthetic dataset.

Findings

01

Temporal information improves depth accuracy in noisy regions.

02

The proposed method outperforms static image-based approaches.

03

Synthetic and real-world experiments validate effectiveness.

Abstract

Depth (disparity) estimation from 4D Light Field (LF) images has been a research topic for the last couple of years. Most studies have focused on depth estimation from static 4D LF images while not considering temporal information, i.e., LF videos. This paper proposes an end-to-end neural network architecture for depth estimation from 4D LF videos. This study also constructs a medium-scale synthetic 4D LF video dataset that can be used for training deep learning-based methods. Experimental results using synthetic and real-world 4D LF videos show that temporal information contributes to the improvement of depth estimation accuracy in noisy regions. Dataset and code is available at: https://mediaeng-lfv.github.io/LFV_Disparity_Estimation

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mediaeng-lfv/LFV_Disparity_Estimation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvLSTM · Convolution · 3D Convolution