Visibility-aware Multi-view Stereo Network

Jingyang Zhang; Yao Yao; Shiwei Li; Zixin Luo; Tian Fang

arXiv:2008.07928·cs.CV·August 20, 2020·52 cites

Visibility-aware Multi-view Stereo Network

Jingyang Zhang, Yao Yao, Shiwei Li, Zixin Luo, Tian Fang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Vis-MVSNet, a multi-view stereo network that explicitly models pixel-wise occlusion to improve depth estimation accuracy in scenes with severe occlusion.

Contribution

It proposes a novel framework that jointly infers and utilizes pixel-wise occlusion information via matching uncertainty estimation in MVS networks.

Findings

01

Significantly improves depth accuracy in occluded scenes

02

Outperforms existing methods on DTU, BlendedMVS, and Tanks and Temples datasets

03

Effectively suppresses the influence of occluded pixels during cost volume fusion

Abstract

Learning-based multi-view stereo (MVS) methods have demonstrated promising results. However, very few existing networks explicitly take the pixel-wise visibility into consideration, resulting in erroneous cost aggregation from occluded pixels. In this paper, we explicitly infer and integrate the pixel-wise occlusion information in the MVS network via the matching uncertainty estimation. The pair-wise uncertainty map is jointly inferred with the pair-wise depth map, which is further used as weighting guidance during the multi-view cost volume fusion. As such, the adverse influence of occluded pixels is suppressed in the cost fusion. The proposed framework Vis-MVSNet significantly improves depth accuracies in the scenes with severe occlusion. Extensive experiments are performed on DTU, BlendedMVS, and Tanks and Temples datasets to justify the effectiveness of the proposed framework.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jzhangbs/Vis-MVSNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications