Monocular Depth Parameterizing Networks

Patrik Persson; Linn \"Ostr\"om; Carl Olsson

arXiv:2012.11301·cs.CV·December 22, 2020

Monocular Depth Parameterizing Networks

Patrik Persson, Linn \"Ostr\"om, Carl Olsson

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel neural network approach that combines monocular and stereo depth estimation techniques, resulting in more accurate and geometrically consistent depth maps from single images.

Contribution

It proposes a network that parameterizes depth maps for improved accuracy and geometric consistency, integrating recognition-based and stereo methods.

Findings

01

Produces more accurate depth maps than existing methods

02

Generalizes better across different datasets

03

Enforces geometric properties in depth estimation

Abstract

Monocular depth estimation is a highly challenging problem that is often addressed with deep neural networks. While these are able to use recognition of image features to predict reasonably looking depth maps the result often has low metric accuracy. In contrast traditional stereo methods using multiple cameras provide highly accurate estimation when pixel matching is possible. In this work we propose to combine the two approaches leveraging their respective strengths. For this purpose we propose a network structure that given an image provides a parameterization of a set of depth maps with feasible shapes. Optimizing over the parameterization then allows us to search the shapes for a photo consistent solution with respect to other images. This allows us to enforce geometric properties that are difficult to observe in single image as well as relaxes the learning problem allowing us to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

patrikperssonmath/MDPN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotic Mechanisms and Dynamics · Manufacturing Process and Optimization