How Far Can I Go ? : A Self-Supervised Approach for Deterministic Video   Depth Forecasting

Sauradip Nag; Nisarg Shah; Anran Qi; Raghavendra Ramachandra

arXiv:2207.00506·cs.CV·July 12, 2022·1 cites

How Far Can I Go ? : A Self-Supervised Approach for Deterministic Video Depth Forecasting

Sauradip Nag, Nisarg Shah, Anran Qi, Raghavendra Ramachandra

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-supervised, deterministic method for predicting future monocular depth in videos, avoiding reliance on large annotated datasets and probabilistic models, by framing depth forecasting as a view synthesis task.

Contribution

It proposes DeFNet, a novel depth forecasting network, and a channel-attention pose estimator, enabling cost-effective, accurate future depth prediction without ground truth depth data.

Findings

01

Outperforms state-of-the-art in Abs Rel metric on KITTI and Cityscapes.

02

Effective for both short-term and mid-term depth forecasting.

03

Demonstrates practical self-supervised depth prediction for unobserved video frames.

Abstract

In this paper we present a novel self-supervised method to anticipate the depth estimate for a future, unobserved real-world urban scene. This work is the first to explore self-supervised learning for estimation of monocular depth of future unobserved frames of a video. Existing works rely on a large number of annotated samples to generate the probabilistic prediction of depth for unseen frames. However, this makes it unrealistic due to its requirement for large amount of annotated depth samples of video. In addition, the probabilistic nature of the case, where one past can have multiple future outcomes often leads to incorrect depth estimates. Unlike previous methods, we model the depth estimation of the unobserved frame as a view-synthesis problem, which treats the depth estimate of the unseen video frame as an auxiliary task while synthesizing back the views using learned pose. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sauradip/depthforecasting
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques