Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume
Qingshan Xu, Wenbing Tao

TL;DR
This paper introduces a novel lightweight cost volume construction using group-wise correlation and recasts multi-view depth inference as an inverse depth regression task, achieving state-of-the-art results in scalability and accuracy.
Contribution
It proposes a new correlation-based cost volume and a cascade 3D U-Net for regularization, enabling efficient and accurate large-scale multi-view stereo depth estimation.
Findings
Achieves state-of-the-art results on DTU and Tanks and Temples datasets.
Reduces memory consumption and computational burden.
Enables sub-pixel depth estimation for large-scale scenes.
Abstract
Deep learning has shown to be effective for depth inference in multi-view stereo (MVS). However, the scalability and accuracy still remain an open problem in this domain. This can be attributed to the memory-consuming cost volume representation and inappropriate depth inference. Inspired by the group-wise correlation in stereo matching, we propose an average group-wise correlation similarity measure to construct a lightweight cost volume. This can not only reduce the memory consumption but also reduce the computational burden in the cost volume filtering. Based on our effective cost volume representation, we propose a cascade 3D U-Net module to regularize the cost volume to further boost the performance. Unlike the previous methods that treat multi-view depth inference as a depth regression problem or an inverse depth classification problem, we recast multi-view depth inference as an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Optical measurement and interference techniques
MethodsConcatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Convolution · U-Net
