Hierarchical Neural Architecture Search for Deep Stereo Matching
Xuelian Cheng, Yiran Zhong, Mehrtash Harandi, Yuchao Dai, Xiaojun, Chang, Tom Drummond, Hongdong Li, Zongyuan Ge

TL;DR
This paper introduces a hierarchical neural architecture search framework tailored for deep stereo matching, significantly improving accuracy, efficiency, and network size by automating the design process with task-specific knowledge.
Contribution
It presents the first end-to-end hierarchical NAS method for stereo matching, jointly optimizing the entire pipeline and outperforming existing handcrafted architectures.
Findings
Achieved top accuracy on KITTI and Middlebury benchmarks.
Reduced network size and inference time.
Outperformed state-of-the-art deep stereo matching networks.
Abstract
To reduce the human efforts in neural network design, Neural Architecture Search (NAS) has been applied with remarkable success to various high-level vision tasks such as classification and semantic segmentation. The underlying idea for the NAS algorithm is straightforward, namely, to enable the network the ability to choose among a set of operations (e.g., convolution with different filter sizes), one is able to find an optimal architecture that is better adapted to the problem at hand. However, so far the success of NAS has not been enjoyed by low-level geometric vision tasks such as stereo matching. This is partly due to the fact that state-of-the-art deep stereo matching networks, designed by humans, are already sheer in size. Directly applying the NAS to such massive structures is computationally prohibitive based on the currently available mainstream computing resources. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
MethodsConvolution
