TL;DR
This paper introduces a lightweight, densely connected neural network for stereo matching that efficiently computes disparity maps without heavy post-processing, demonstrating strong results on multiple benchmarks.
Contribution
The paper presents a novel fully-convolutional densely connected neural network architecture for stereo estimation that is lightweight and avoids complex cost-aggregation methods.
Findings
Performs well on indoor and outdoor scenes
Achieves competitive results on Middlebury, KITTI, ETH3D
Uses simple filtering and consistency checks for refinement
Abstract
We propose a novel lightweight network for stereo estimation. Our network consists of a fully-convolutional densely connected neural network (FC-DCNN) that computes matching costs between rectified image pairs. Our FC-DCNN method learns expressive features and performs some simple but effective post-processing steps. The densely connected layer structure connects the output of each layer to the input of each subsequent layer. This network structure and the fact that we do not use any fully-connected layers or 3D convolutions leads to a very lightweight network. The output of this network is used in order to calculate matching costs and create a cost-volume. Instead of using time and memory-inefficient cost-aggregation methods such as semi-global matching or conditional random fields in order to improve the result, we rely on filtering techniques, namely median filter and guided filter.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDense Connections
