AMNet: Deep Atrous Multiscale Stereo Disparity Estimation Networks
Xianzhi Du, Mostafa El-Khamy, Jungwon Lee

TL;DR
This paper introduces AMNet, a deep multiscale stereo disparity estimation network utilizing atrous convolutions and an extended cost volume, achieving state-of-the-art accuracy on multiple benchmarks.
Contribution
It presents a novel deep learning architecture with multiscale feature extraction and a foreground-background aware variant, trained via an iterative multitask method.
Findings
Achieves high accuracy on Middlebury, KITTI, and Sceneflow benchmarks.
Outperforms previous methods in stereo disparity estimation.
Demonstrates effective multiscale contextual information aggregation.
Abstract
In this paper, a new deep learning architecture for stereo disparity estimation is proposed. The proposed atrous multiscale network (AMNet) adopts an efficient feature extractor with depthwise-separable convolutions and an extended cost volume that deploys novel stereo matching costs on the deep features. A stacked atrous multiscale network is proposed to aggregate rich multiscale contextual information from the cost volume which allows for estimating the disparity with high accuracy at multiple scales. AMNet can be further modified to be a foreground-background aware network, FBA-AMNet, which is capable of discriminating between the foreground and the background objects in the scene at multiple scales. An iterative multitask learning method is proposed to train FBA-AMNet end-to-end. The proposed disparity estimation networks, AMNet and FBA-AMNet, show accurate disparity estimates and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques
