Blending 3D Geometry and Machine Learning for Multi-View Stereopsis
Vibhas Vats, Md. Alimoor Reza, David Crandall, Soon-heung Jung

TL;DR
This paper presents GC MVSNet++, a novel multi-view stereo method that enforces geometric consistency during learning, leading to faster training and state-of-the-art results on multiple datasets.
Contribution
It introduces the first method to incorporate multi-view, multi-scale geometric consistency enforcement during the training process of MVS networks.
Findings
Halves training iterations compared to previous methods.
Achieves state-of-the-art on DTU and BlendedMVS datasets.
Secures second place on Tanks and Temples benchmark.
Abstract
Traditional multi-view stereo (MVS) methods primarily depend on photometric and geometric consistency constraints. In contrast, modern learning-based algorithms often rely on the plane sweep algorithm to infer 3D geometry, applying explicit geometric consistency (GC) checks only as a post-processing step, with no impact on the learning process itself. In this work, we introduce GC MVSNet plus plus, a novel approach that actively enforces geometric consistency of reference view depth maps across multiple source views (multi view) and at various scales (multi scale) during the learning phase (see Fig. 1). This integrated GC check significantly accelerates the learning process by directly penalizing geometrically inconsistent pixels, effectively halving the number of training iterations compared to other MVS methods. Furthermore, we introduce a densely connected cost regularization network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Satellite Image Processing and Photogrammetry
