A Novel Monocular Disparity Estimation Network with Domain Transformation and Ambiguity Learning
Juan Luis Gonzalez Bello, Munchurl Kim

TL;DR
This paper introduces a new unsupervised monocular disparity estimation network that improves accuracy, reduces parameters, and estimates full disparity maps in a single pass, outperforming previous methods on the KITTI dataset.
Contribution
The paper presents a novel encoder-decoder architecture with domain transformation and ambiguity learning for unsupervised monocular disparity estimation, achieving superior performance with fewer parameters.
Findings
Outperforms Monodepth baseline in all metrics
Reduces model parameters significantly
Estimates full disparity map in a single forward pass
Abstract
Convolutional neural networks (CNN) have shown state-of-the-art results for low-level computer vision problems such as stereo and monocular disparity estimations, but still, have much room to further improve their performance in terms of accuracy, numbers of parameters, etc. Recent works have uncovered the advantages of using an unsupervised scheme to train CNN's to estimate monocular disparity, where only the relatively-easy-to-obtain stereo images are needed for training. We propose a novel encoder-decoder architecture that outperforms previous unsupervised monocular depth estimation networks by (i) taking into account ambiguities, (ii) efficient fusion between encoder and decoder features with rectangular convolutions and (iii) domain transformations between encoder and decoder. Our architecture outperforms the Monodepth baseline in all metrics, even with a considerable reduction of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
