Multi-scale Iterative Residuals for Fast and Scalable Stereo Matching
Kumail Raza, Ren\'e Schuster, Didier Stricker

TL;DR
This paper introduces an iterative multi-scale refinement framework for stereo matching that significantly improves speed and scalability while maintaining high accuracy, enabling real-time performance on high-resolution inputs.
Contribution
The paper proposes a novel multi-scale iterative refinement approach that enhances existing stereo networks for faster, scalable, and accurate stereo matching.
Findings
49× faster inference than GANetdeep
4× less memory consumption with comparable error
Scalable to 6K resolution with real-time inference
Abstract
Despite the remarkable progress of deep learning in stereo matching, there exists a gap in accuracy between real-time models and slower state-of-the-art models which are suitable for practical applications. This paper presents an iterative multi-scale coarse-to-fine refinement (iCFR) framework to bridge this gap by allowing it to adopt any stereo matching network to make it fast, more efficient and scalable while keeping comparable accuracy. To reduce the computational cost of matching, we use multi-scale warped features to estimate disparity residuals and push the disparity search range in the cost volume to a minimum limit. Finally, we apply a refinement network to recover the loss of precision which is inherent in multi-scale approaches. We test our iCFR framework by adopting the matching networks from state-of-the art GANet and AANet. The result is 49 faster inference time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques
MethodsTest
