Unsupervised Monocular Depth Estimation with Left-Right Consistency
Cl\'ement Godard, Oisin Mac Aodha, Gabriel J. Brostow

TL;DR
This paper introduces an unsupervised method for monocular depth estimation using stereo images and epipolar geometry, achieving state-of-the-art results without requiring ground truth depth data during training.
Contribution
It proposes a novel unsupervised training framework that enforces left-right disparity consistency, eliminating the need for explicit depth supervision.
Findings
Achieves state-of-the-art monocular depth estimation on KITTI dataset.
Outperforms some supervised methods trained with ground truth depth.
Demonstrates robustness and improved accuracy over existing unsupervised approaches.
Abstract
Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we innovate beyond existing approaches, replacing the use of explicit depth data during training with easier-to-obtain binocular stereo footage. We propose a novel training objective that enables our convolutional neural network to learn to perform single image depth estimation, despite the absence of ground truth depth data. Exploiting epipolar geometry constraints, we generate disparity images by training our network with an image reconstruction loss. We show that solving for image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Unsupervised Monocular Depth Estimation With Left-Right Consistency· youtube
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
