Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction
Huangying Zhan, Ravi Garg, Chamara Saroj Weerasekera, Kejie Li, Harsh, Agarwal, Ian Reid

TL;DR
This paper introduces an unsupervised framework that jointly learns monocular depth estimation and visual odometry from stereo sequences, leveraging deep feature reconstruction to improve accuracy and scale consistency.
Contribution
It proposes a novel unsupervised learning approach using stereo sequences and deep feature warping, enhancing depth and odometry estimation without full supervision.
Findings
Joint training improves depth prediction accuracy.
Deep feature-based warping outperforms simple photometric warping.
Method surpasses existing learning-based methods on KITTI dataset.
Abstract
Despite learning based methods showing promising results in single view depth estimation and visual odometry, most existing approaches treat the tasks in a supervised manner. Recent approaches to single view depth estimation explore the possibility of learning without full supervision via minimizing photometric error. In this paper, we explore the use of stereo sequences for learning depth and visual odometry. The use of stereo sequences enables the use of both spatial (between left-right pairs) and temporal (forward backward) photometric warp error, and constrains the scene depth and camera motion to be in a common, real-world scale. At test time our framework is able to estimate single view depth and two-view odometry from a monocular sequence. We also show how we can improve on a standard photometric warp loss by considering a warp of deep features. We show through extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Image Processing Techniques and Applications
