Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth   Approach with Saddle-shaped Depth Cells

Xinyi Ye; Weiyue Zhao; Tianqi Liu; Zihao Huang; Zhiguo Cao; Xin Li

arXiv:2307.09160·cs.CV·July 19, 2023

Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth Approach with Saddle-shaped Depth Cells

Xinyi Ye, Weiyue Zhao, Tianqi Liu, Zihao Huang, Zhiguo Cao, Xin Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel dual-depth approach with saddle-shaped depth cells for multi-view stereo, significantly improving depth geometry modeling and achieving top benchmark performance.

Contribution

It proposes a new saddle-shaped depth geometry and a dual-depth prediction framework, enhancing multi-view stereo accuracy by constraining depth oscillations around the ground truth.

Findings

01

High rank on DTU benchmark

02

Top performance on Tanks and Temples scenes

03

Demonstrates the importance of depth geometry in MVS

Abstract

Learning-based multi-view stereo (MVS) methods deal with predicting accurate depth maps to achieve an accurate and complete 3D representation. Despite the excellent performance, existing methods ignore the fact that a suitable depth geometry is also critical in MVS. In this paper, we demonstrate that different depth geometries have significant performance gaps, even using the same depth prediction error. Therefore, we introduce an ideal depth geometry composed of Saddle-Shaped Cells, whose predicted depth map oscillates upward and downward around the ground-truth surface, rather than maintaining a continuous and smooth depth plane. To achieve it, we develop a coarse-to-fine framework called Dual-MVSNet (DMVSNet), which can produce an oscillating depth plane. Technically, we predict two depth values for each pixel (Dual-Depth), and propose a novel loss function and a checkerboard-shaped…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dive128/dmvsnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Image Enhancement Techniques