SCV-Stereo: Learning Stereo Matching from a Sparse Cost Volume
Hengli Wang, Rui Fan, Ming Liu

TL;DR
SCV-Stereo introduces a CNN architecture that learns stereo matching efficiently from sparse cost volumes, reducing computational load while maintaining high accuracy, as demonstrated on KITTI benchmarks.
Contribution
The paper presents a novel CNN model that replaces dense cost volumes with sparse ones, improving efficiency without sacrificing accuracy in stereo matching.
Findings
Significantly reduces computational and memory requirements.
Achieves competitive accuracy on KITTI benchmarks.
Enables iterative disparity updates for improved results.
Abstract
Convolutional neural network (CNN)-based stereo matching approaches generally require a dense cost volume (DCV) for disparity estimation. However, generating such cost volumes is computationally-intensive and memory-consuming, hindering CNN training and inference efficiency. To address this problem, we propose SCV-Stereo, a novel CNN architecture, capable of learning dense stereo matching from sparse cost volume (SCV) representations. Our inspiration is derived from the fact that DCV representations are somewhat redundant and can be replaced with SCV representations. Benefiting from these SCV representations, our SCV-Stereo can update disparity estimations in an iterative fashion for accurate and efficient stereo matching. Extensive experiments carried out on the KITTI Stereo benchmarks demonstrate that our SCV-Stereo can significantly minimize the trade-off between accuracy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Image Processing Techniques and Applications
