DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range
Puyuan Yi, Shengkun Tang, Jian Yao

TL;DR
DDR-Net introduces a dynamic depth range estimation module for multi-view stereo, enabling adaptive depth hypotheses that improve high-resolution depth map accuracy over fixed-range methods.
Contribution
The paper proposes DDR-Net, which dynamically estimates depth ranges using a range estimation module, enhancing multi-view stereo accuracy and efficiency compared to fixed-range approaches.
Findings
Achieves superior performance on DTU benchmark
Obtains comparable results on Tanks and Temples benchmark
Uses a novel loss strategy with learned dynamic depth ranges
Abstract
To obtain high-resolution depth maps, some previous learning-based multi-view stereo methods build a cost volume pyramid in a coarse-to-fine manner. These approaches leverage fixed depth range hypotheses to construct cascaded plane sweep volumes. However, it is inappropriate to set identical range hypotheses for each pixel since the uncertainties of previous per-pixel depth predictions are spatially varying. Distinct from these approaches, we propose a Dynamic Depth Range Network (DDR-Net) to determine the depth range hypotheses dynamically by applying a range estimation module (REM) to learn the uncertainties of range hypotheses in the former stages. Specifically, in our DDR-Net, we first build an initial depth map at the coarsest resolution of an image across the entire depth range. Then the range estimation module (REM) leverages the probability distribution information of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization
