Direct Depth Learning Network for Stereo Matching
Hong Zhang, Haojie Li, Shenglun Chen, Tiantian Yan, Zhihui, Wang, Guo Lu, Wanli Ouyang

TL;DR
This paper introduces DDL-Net, a novel stereo matching network that directly estimates depth using a two-stage process, improving accuracy especially for distant points by considering depth range and uncertainty.
Contribution
The paper proposes a two-stage depth-focused stereo matching network with depth supervision and a novel Granularity Uncertainty mechanism for improved distant depth estimation.
Findings
25% improvement on SceneFlow dataset
12% improvement on DrivingStereo dataset
State-of-the-art accuracy at large distances
Abstract
Being a crucial task of autonomous driving, Stereo matching has made great progress in recent years. Existing stereo matching methods estimate disparity instead of depth. They treat the disparity errors as the evaluation metric of the depth estimation errors, since the depth can be calculated from the disparity according to the triangulation principle. However, we find that the error of the depth depends not only on the error of the disparity but also on the depth range of the points. Therefore, even if the disparity error is low, the depth error is still large, especially for the distant points. In this paper, a novel Direct Depth Learning Network (DDL-Net) is designed for stereo matching. DDL-Net consists of two stages: the Coarse Depth Estimation stage and the Adaptive-Grained Depth Refinement stage, which are all supervised by depth instead of disparity. Specifically, Coarse Depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
