SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization

Zhenlong Yuan; Jiakai Cao; Zhaoxin Li; Hao Jiang; Zhaoqi Wang

arXiv:2401.06385·cs.CV·November 26, 2025·2 cites

SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization

Zhenlong Yuan, Jiakai Cao, Zhaoxin Li, Hao Jiang, Zhaoqi Wang

PDF

Open Access 1 Video

TL;DR

SD-MVS introduces a segmentation-driven approach using SAM and EM optimization to improve 3D reconstruction quality, especially in textureless areas, achieving state-of-the-art results efficiently.

Contribution

The paper presents a novel segmentation-driven multi-view stereo method that integrates SAM for semantic constraints and EM for parameter optimization, enhancing reconstruction completeness and robustness.

Findings

01

Achieves state-of-the-art results on ETH3D and Tanks and Temples datasets.

02

Effectively handles textureless regions in 3D reconstruction.

03

Reduces time consumption compared to existing methods.

Abstract

In this paper, we introduce Segmentation-Driven Deformation Multi-View Stereo (SD-MVS), a method that can effectively tackle challenges in 3D reconstruction of textureless areas. We are the first to adopt the Segment Anything Model (SAM) to distinguish semantic instances in scenes and further leverage these constraints for pixelwise patch deformation on both matching cost and propagation. Concurrently, we propose a unique refinement strategy that combines spherical coordinates and gradient descent on normals and pixelwise search interval on depths, significantly improving the completeness of reconstructed 3D model. Furthermore, we adopt the Expectation-Maximization (EM) algorithm to alternately optimize the aggregate matching cost and hyperparameters, effectively mitigating the problem of parameters being excessively dependent on empirical tuning. Evaluations on the ETH3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM Optimization· underline

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques