Learning Depth with Convolutional Spatial Propagation Network
Xinjing Cheng, Peng Wang, Ruigang Yang

TL;DR
This paper introduces a convolutional spatial propagation network (CSPN) that learns pixel affinities for depth estimation, improving accuracy and speed in depth completion and stereo matching tasks.
Contribution
It proposes a novel CSPN module that can enhance existing depth estimation networks and extends it to 3D for stereo matching, achieving state-of-the-art results.
Findings
30% reduction in depth error on NYU v2 and KITTI datasets
2 to 5 times faster than previous methods
Rank 1 on KITTI Stereo benchmarks
Abstract
Depth prediction is one of the fundamental problems in computer vision. In this paper, we propose a simple yet effective convolutional spatial propagation network (CSPN) to learn the affinity matrix for various depth estimation tasks. Specifically, it is an efficient linear propagation model, in which the propagation is performed with a manner of recurrent convolutional operation, and the affinity among neighboring pixels is learned through a deep convolutional neural network (CNN). We can append this module to any output from a state-of-the-art (SOTA) depth estimation networks to improve their performances. In practice, we further extend CSPN in two aspects: 1) take sparse depth map as additional input, which is useful for the task of depth completion; 2) similar to commonly used 3D convolution operation in CNNs, we propose 3D CSPN to handle features with one additional dimension,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques
Methods3D Convolution · Convolution
