Deeper Depth Prediction with Fully Convolutional Residual Networks

Iro Laina; Christian Rupprecht; Vasileios Belagiannis; Federico; Tombari; Nassir Navab

arXiv:1606.00373·cs.CV·September 20, 2016

Deeper Depth Prediction with Fully Convolutional Residual Networks

Iro Laina, Christian Rupprecht, Vasileios Belagiannis, Federico, Tombari, Nassir Navab

PDF

5 Repos

TL;DR

This paper introduces a fully convolutional residual network for monocular depth estimation that improves resolution learning, uses a novel loss function, and achieves real-time performance with fewer parameters and better accuracy.

Contribution

It presents a novel end-to-end deep learning architecture with efficient up-sampling and a new loss function for improved monocular depth prediction.

Findings

01

Outperforms existing methods on depth estimation benchmarks.

02

Operates in real-time without post-processing.

03

Uses fewer parameters and training data than prior models.

Abstract

This paper addresses the problem of estimating the depth map of a scene given a single RGB image. We propose a fully convolutional architecture, encompassing residual learning, to model the ambiguous mapping between monocular images and depth maps. In order to improve the output resolution, we present a novel way to efficiently learn feature map up-sampling within the network. For optimization, we introduce the reverse Huber loss that is particularly suited for the task at hand and driven by the value distributions commonly present in depth maps. Our model is composed of a single architecture that is trained end-to-end and does not rely on post-processing techniques, such as CRFs or other additional refinement steps. As a result, it runs in real-time on images or videos. In the evaluation, we show that the proposed model contains fewer parameters and requires fewer training data than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsHuber loss