Monocular Depth Parameterizing Networks
Patrik Persson, Linn \"Ostr\"om, Carl Olsson

TL;DR
This paper introduces a novel neural network approach that combines monocular and stereo depth estimation techniques, resulting in more accurate and geometrically consistent depth maps from single images.
Contribution
It proposes a network that parameterizes depth maps for improved accuracy and geometric consistency, integrating recognition-based and stereo methods.
Findings
Produces more accurate depth maps than existing methods
Generalizes better across different datasets
Enforces geometric properties in depth estimation
Abstract
Monocular depth estimation is a highly challenging problem that is often addressed with deep neural networks. While these are able to use recognition of image features to predict reasonably looking depth maps the result often has low metric accuracy. In contrast traditional stereo methods using multiple cameras provide highly accurate estimation when pixel matching is possible. In this work we propose to combine the two approaches leveraging their respective strengths. For this purpose we propose a network structure that given an image provides a parameterization of a set of depth maps with feasible shapes. Optimizing over the parameterization then allows us to search the shapes for a photo consistent solution with respect to other images. This allows us to enforce geometric properties that are difficult to observe in single image as well as relaxes the learning problem allowing us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotic Mechanisms and Dynamics · Manufacturing Process and Optimization
