SCV-Stereo: Learning Stereo Matching from a Sparse Cost Volume

Hengli Wang; Rui Fan; Ming Liu

arXiv:2107.08187·cs.CV·July 20, 2021

SCV-Stereo: Learning Stereo Matching from a Sparse Cost Volume

Hengli Wang, Rui Fan, Ming Liu

PDF

Open Access

TL;DR

SCV-Stereo introduces a CNN architecture that learns stereo matching efficiently from sparse cost volumes, reducing computational load while maintaining high accuracy, as demonstrated on KITTI benchmarks.

Contribution

The paper presents a novel CNN model that replaces dense cost volumes with sparse ones, improving efficiency without sacrificing accuracy in stereo matching.

Findings

01

Significantly reduces computational and memory requirements.

02

Achieves competitive accuracy on KITTI benchmarks.

03

Enables iterative disparity updates for improved results.

Abstract

Convolutional neural network (CNN)-based stereo matching approaches generally require a dense cost volume (DCV) for disparity estimation. However, generating such cost volumes is computationally-intensive and memory-consuming, hindering CNN training and inference efficiency. To address this problem, we propose SCV-Stereo, a novel CNN architecture, capable of learning dense stereo matching from sparse cost volume (SCV) representations. Our inspiration is derived from the fact that DCV representations are somewhat redundant and can be replaced with SCV representations. Benefiting from these SCV representations, our SCV-Stereo can update disparity estimations in an iterative fashion for accurate and efficient stereo matching. Extensive experiments carried out on the KITTI Stereo benchmarks demonstrate that our SCV-Stereo can significantly minimize the trade-off between accuracy and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Image Processing Techniques and Applications