AMNet: Deep Atrous Multiscale Stereo Disparity Estimation Networks

Xianzhi Du; Mostafa El-Khamy; Jungwon Lee

arXiv:1904.09099·cs.CV·April 22, 2019·49 cites

AMNet: Deep Atrous Multiscale Stereo Disparity Estimation Networks

Xianzhi Du, Mostafa El-Khamy, Jungwon Lee

PDF

Open Access

TL;DR

This paper introduces AMNet, a deep multiscale stereo disparity estimation network utilizing atrous convolutions and an extended cost volume, achieving state-of-the-art accuracy on multiple benchmarks.

Contribution

It presents a novel deep learning architecture with multiscale feature extraction and a foreground-background aware variant, trained via an iterative multitask method.

Findings

01

Achieves high accuracy on Middlebury, KITTI, and Sceneflow benchmarks.

02

Outperforms previous methods in stereo disparity estimation.

03

Demonstrates effective multiscale contextual information aggregation.

Abstract

In this paper, a new deep learning architecture for stereo disparity estimation is proposed. The proposed atrous multiscale network (AMNet) adopts an efficient feature extractor with depthwise-separable convolutions and an extended cost volume that deploys novel stereo matching costs on the deep features. A stacked atrous multiscale network is proposed to aggregate rich multiscale contextual information from the cost volume which allows for estimating the disparity with high accuracy at multiple scales. AMNet can be further modified to be a foreground-background aware network, FBA-AMNet, which is capable of discriminating between the foreground and the background objects in the scene at multiple scales. An iterative multitask learning method is proposed to train FBA-AMNet end-to-end. The proposed disparity estimation networks, AMNet and FBA-AMNet, show accurate disparity estimates and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques