3DVNet: Multi-View Depth Prediction and Volumetric Refinement

Alexander Rich; Noah Stier; Pradeep Sen; Tobias H\"ollerer

arXiv:2112.00202·cs.CV·December 2, 2021

3DVNet: Multi-View Depth Prediction and Volumetric Refinement

Alexander Rich, Noah Stier, Pradeep Sen, Tobias H\"ollerer

PDF

1 Repo

TL;DR

3DVNet introduces a multi-view stereo depth prediction method that combines volumetric 3D CNNs and iterative refinement to achieve superior accuracy in 3D scene reconstruction.

Contribution

The paper proposes a novel 3D CNN-based multi-view stereo approach that iteratively refines depth maps using scene-level priors and feature-augmented point clouds.

Findings

01

Outperforms state-of-the-art in depth prediction accuracy

02

Achieves superior 3D reconstruction quality

03

Generalizes well across different datasets

Abstract

We present 3DVNet, a novel multi-view stereo (MVS) depth-prediction method that combines the advantages of previous depth-based and volumetric MVS approaches. Our key idea is the use of a 3D scene-modeling network that iteratively updates a set of coarse depth predictions, resulting in highly accurate predictions which agree on the underlying scene geometry. Unlike existing depth-prediction techniques, our method uses a volumetric 3D convolutional neural network (CNN) that operates in world space on all depth maps jointly. The network can therefore learn meaningful scene-level priors. Furthermore, unlike existing volumetric MVS techniques, our 3D CNN operates on a feature-augmented point cloud, allowing for effective aggregation of multi-view information and flexible iterative refinement of depth maps. Experimental results show our method exceeds state-of-the-art accuracy in both depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexrich021/3dvnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods3 Dimensional Convolutional Neural Network