A Novel Monocular Disparity Estimation Network with Domain   Transformation and Ambiguity Learning

Juan Luis Gonzalez Bello; Munchurl Kim

arXiv:1903.08514·eess.IV·March 22, 2019·5 cites

A Novel Monocular Disparity Estimation Network with Domain Transformation and Ambiguity Learning

Juan Luis Gonzalez Bello, Munchurl Kim

PDF

Open Access

TL;DR

This paper introduces a new unsupervised monocular disparity estimation network that improves accuracy, reduces parameters, and estimates full disparity maps in a single pass, outperforming previous methods on the KITTI dataset.

Contribution

The paper presents a novel encoder-decoder architecture with domain transformation and ambiguity learning for unsupervised monocular disparity estimation, achieving superior performance with fewer parameters.

Findings

01

Outperforms Monodepth baseline in all metrics

02

Reduces model parameters significantly

03

Estimates full disparity map in a single forward pass

Abstract

Convolutional neural networks (CNN) have shown state-of-the-art results for low-level computer vision problems such as stereo and monocular disparity estimations, but still, have much room to further improve their performance in terms of accuracy, numbers of parameters, etc. Recent works have uncovered the advantages of using an unsupervised scheme to train CNN's to estimate monocular disparity, where only the relatively-easy-to-obtain stereo images are needed for training. We propose a novel encoder-decoder architecture that outperforms previous unsupervised monocular depth estimation networks by (i) taking into account ambiguities, (ii) efficient fusion between encoder and decoder features with rectangular convolutions and (iii) domain transformations between encoder and decoder. Our architecture outperforms the Monodepth baseline in all metrics, even with a considerable reduction of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques