How Far Can I Go ? : A Self-Supervised Approach for Deterministic Video Depth Forecasting
Sauradip Nag, Nisarg Shah, Anran Qi, Raghavendra Ramachandra

TL;DR
This paper introduces a self-supervised, deterministic method for predicting future monocular depth in videos, avoiding reliance on large annotated datasets and probabilistic models, by framing depth forecasting as a view synthesis task.
Contribution
It proposes DeFNet, a novel depth forecasting network, and a channel-attention pose estimator, enabling cost-effective, accurate future depth prediction without ground truth depth data.
Findings
Outperforms state-of-the-art in Abs Rel metric on KITTI and Cityscapes.
Effective for both short-term and mid-term depth forecasting.
Demonstrates practical self-supervised depth prediction for unobserved video frames.
Abstract
In this paper we present a novel self-supervised method to anticipate the depth estimate for a future, unobserved real-world urban scene. This work is the first to explore self-supervised learning for estimation of monocular depth of future unobserved frames of a video. Existing works rely on a large number of annotated samples to generate the probabilistic prediction of depth for unseen frames. However, this makes it unrealistic due to its requirement for large amount of annotated depth samples of video. In addition, the probabilistic nature of the case, where one past can have multiple future outcomes often leads to incorrect depth estimates. Unlike previous methods, we model the depth estimation of the unobserved frame as a view-synthesis problem, which treats the depth estimate of the unseen video frame as an auxiliary task while synthesizing back the views using learned pose. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques
