# Monocular Depth Estimation with Directional Consistency by Deep Networks

**Authors:** Fabian Truetsch, Alfred Sch\"ottl

arXiv: 1905.04467 · 2019-05-14

## TL;DR

This paper introduces a monocular depth estimation model that combines structure from motion and stereo disparity, using viewpoint transformations and consistency constraints to improve accuracy in depth prediction.

## Contribution

The proposed approach uniquely integrates structure from motion with stereo disparity and employs viewpoint-based supervision, enhancing depth estimation accuracy over existing methods.

## Key findings

- Outperforms individual components in depth accuracy
- Utilizes viewpoint transformations for supervision
- Increases accuracy by combining two approaches

## Abstract

As processing power has become more available, more human-like artificial intelligences are created to solve image processing tasks that we are inherently good at. As such we propose a model that estimates depth from a monocular image. Our approach utilizes a combination of structure from motion and stereo disparity. We estimate a pose between the source image and a different viewpoint and a dense depth map and use a simple transformation to reconstruct the image seen from said viewpoint. We can then use the real image at that viewpoint to act as supervision to train out model. The metric chosen for image comparison employs standard L1 and structural similarity and a consistency constraint between depth maps as well as smoothness constraint. We show that similar to human perception utilizing the correlation within the provided data by two different approaches increases the accuracy and outperforms the individual components.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.04467/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1905.04467/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/1905.04467/full.md

---
Source: https://tomesphere.com/paper/1905.04467