On the Benefit of Adversarial Training for Monocular Depth Estimation

Rick Groenendijk; Sezer Karaoglu; Theo Gevers; Thomas Mensink

arXiv:1910.13340·eess.IV·October 30, 2019

On the Benefit of Adversarial Training for Monocular Depth Estimation

Rick Groenendijk, Sezer Karaoglu, Theo Gevers, Thomas Mensink

PDF

1 Repo

TL;DR

This paper investigates the impact of adversarial training on monocular depth estimation, finding it beneficial only with less constrained reconstruction losses, and achieves state-of-the-art results with specific training strategies.

Contribution

It provides a comprehensive evaluation of adversarial training methods in monocular depth estimation and identifies conditions under which they improve performance.

Findings

01

Adversarial training benefits are limited to less constrained reconstruction losses.

02

Non-adversarial methods outperform GAN-based methods with constrained losses.

03

State-of-the-art results are achieved using batch normalization and multi-scale outputs.

Abstract

In this paper we address the benefit of adding adversarial training to the task of monocular depth estimation. A model can be trained in a self-supervised setting on stereo pairs of images, where depth (disparities) are an intermediate result in a right-to-left image reconstruction pipeline. For the quality of the image reconstruction and disparity prediction, a combination of different losses is used, including L1 image reconstruction losses and left-right disparity smoothness. These are local pixel-wise losses, while depth prediction requires global consistency. Therefore, we extend the self-supervised network to become a Generative Adversarial Network (GAN), by including a discriminator which should tell apart reconstructed (fake) images from real images. We evaluate Vanilla GANs, LSGANs and Wasserstein GANs in combination with different pixel-wise reconstruction losses. Based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rickgroen/depthgan
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729