OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline
Xianda Guo, Chenming Zhang, Juntao Lu, Yiqun Duan, Yiqi Wang, Tian, Yang, Zheng Zhu, Long Chen

TL;DR
OpenStereo provides a comprehensive, flexible benchmark and toolbox for stereo matching, enabling evaluation and development of models with state-of-the-art performance and strong generalization across datasets.
Contribution
The paper introduces OpenStereo, a complete stereo matching benchmark and codebase, along with a strong baseline model, StereoBase, that achieves top performance and generalization.
Findings
StereoBase ranks 1st on SceneFlow and KITTI benchmarks.
OpenStereo includes over 10 network models for training and inference.
StereoBase demonstrates strong cross-dataset generalization.
Abstract
Stereo matching aims to estimate the disparity between matching pixels in a stereo image pair, which is important to robotics, autonomous driving, and other computer vision tasks. Despite the development of numerous impressive methods in recent years, determining the most suitable architecture for practical application remains challenging. Addressing this gap, our paper introduces a comprehensive benchmark focusing on practical applicability rather than solely on individual models for optimized performance. Specifically, we develop a flexible and efficient stereo matching codebase, called OpenStereo. OpenStereo includes training and inference codes of more than 10 network models, making it, to our knowledge, the most complete stereo matching toolbox available. Based on OpenStereo, we conducted experiments and have achieved or surpassed the performance metrics reported in the original…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications
