Do End-to-end Stereo Algorithms Under-utilize Information?

Changjiang Cai; Philippos Mordohai

arXiv:2010.07350·cs.CV·October 16, 2020

Do End-to-end Stereo Algorithms Under-utilize Information?

Changjiang Cai, Philippos Mordohai

PDF

Open Access 1 Repo

TL;DR

This paper enhances end-to-end stereo matching networks by integrating adaptive filtering and semi-global aggregation, utilizing RGB information to improve disparity accuracy, especially near occlusions and thin structures.

Contribution

It introduces a method to incorporate deep adaptive filtering and differentiable semi-global aggregation into existing stereo networks, leveraging RGB cues for better disparity estimation.

Findings

01

Significant accuracy improvements on KITTI datasets.

02

Enhanced handling of occlusion boundaries and thin structures.

03

Effective integration across multiple existing architectures.

Abstract

Deep networks for stereo matching typically leverage 2D or 3D convolutional encoder-decoder architectures to aggregate cost and regularize the cost volume for accurate disparity estimation. Due to content-insensitive convolutions and down-sampling and up-sampling operations, these cost aggregation mechanisms do not take full advantage of the information available in the images. Disparity maps suffer from over-smoothing near occlusion boundaries, and erroneous predictions in thin structures. In this paper, we show how deep adaptive filtering and differentiable semi-global aggregation can be integrated in existing 2D and 3D convolutional networks for end-to-end stereo matching, leading to improved accuracy. The improvements are due to utilizing RGB information from the images as a signal to dynamically guide the matching process, in addition to being the signal we attempt to match across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ccj5351/DAFStereoNets
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques

Methods1x1 Convolution · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Layer Normalization · Convolution · Global Context Block · GCNet