Shift Convolution Network for Stereo Matching

Jian Xie

arXiv:1911.08896·cs.CV·November 21, 2019

Shift Convolution Network for Stereo Matching

Jian Xie

PDF

Open Access

TL;DR

This paper introduces ShiftConvNet, a fast and accurate stereo matching network that replaces traditional correlation with shift convolution layers, achieving state-of-the-art results on benchmark datasets.

Contribution

The paper proposes a novel shift convolution layer and architecture for stereo matching, improving speed and accuracy over existing methods.

Findings

01

Achieves state-of-the-art results on FlyingThings 3D dataset.

02

Runs at 5 fps, faster than comparable methods.

03

Improves disparity estimation accuracy with auto shift convolution refinement.

Abstract

In this paper, we present Shift Convolution Network (ShiftConvNet) to provide matching capability between two feature maps for stereo estimation. The proposed method can speedily produce a highly accurate disparity map from stereo images. A module called shift convolution layer is proposed to replace the traditional correlation layer to perform patch comparisons between two feature maps. By using a novel architecture of convolutional network to learn the matching process, ShiftConvNet can produce better results than DispNet-C[1], also running faster with 5 fps. Moreover, with a proposed auto shift convolution refine part, further improvement is obtained. The proposed approach was evaluated on FlyingThings 3D. It achieves state-of-the-art results on the benchmark dataset. Codes will be made available at github.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization

MethodsConvolution