GA-Net: Guided Aggregation Net for End-to-end Stereo Matching

Feihu Zhang; Victor Prisacariu; Ruigang Yang; Philip H.S. Torr

arXiv:1904.06587·cs.CV·April 16, 2019·47 cites

GA-Net: Guided Aggregation Net for End-to-end Stereo Matching

Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H.S. Torr

PDF

Open Access 3 Repos

TL;DR

GA-Net introduces two novel neural network layers for cost aggregation in stereo matching, replacing costly 3D convolutions, leading to improved accuracy and efficiency on benchmark datasets.

Contribution

The paper proposes semi-global and local guided aggregation layers that enhance stereo matching accuracy while reducing computational complexity.

Findings

01

Outperforms state-of-the-art GC-Net with fewer layers

02

Achieves better accuracy on Scene Flow and KITTI datasets

03

Reduces computational cost compared to 3D convolutional approaches

Abstract

In the stereo matching task, matching cost aggregation is crucial in both traditional methods and deep neural network models in order to accurately estimate disparities. We propose two novel neural net layers, aimed at capturing local and the whole-image cost dependencies respectively. The first is a semi-global aggregation layer which is a differentiable approximation of the semi-global matching, the second is the local guided aggregation layer which follows a traditional cost filtering strategy to refine thin structures. These two layers can be used to replace the widely used 3D convolutional layer which is computationally costly and memory-consuming as it has cubic computational/memory complexity. In the experiments, we show that nets with a two-layer guided aggregation block easily outperform the state-of-the-art GC-Net which has nineteen 3D convolutional layers. We also train a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques