SDL-MVS: View Space and Depth Deformable Learning Paradigm for   Multi-View Stereo Reconstruction in Remote Sensing

Yong-Qiang Mao; Hanbo Bi; Liangyu Xu; Kaiqiang Chen; Zhirui Wang; Xian; Sun; Kun Fu

arXiv:2405.17140·cs.CV·May 28, 2024

SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing

Yong-Qiang Mao, Hanbo Bi, Liangyu Xu, Kaiqiang Chen, Zhirui Wang, Xian, Sun, Kun Fu

PDF

Open Access

TL;DR

SDL-MVS introduces a deformable learning paradigm for multi-view stereo in remote sensing, effectively handling occlusion and brightness variations to improve depth estimation accuracy in large-scale urban 3D reconstruction.

Contribution

It proposes view space and depth deformable learning methods, including PSS and DHD, to enhance feature interaction and depth modeling in multi-view stereo reconstruction.

Findings

01

Achieves state-of-the-art performance on LuoJia-MVS and WHU datasets.

02

Attains an MAE error of 0.086 in depth estimation.

03

Reaches 98.9% accuracy for <0.6m depth error.

Abstract

Research on multi-view stereo based on remote sensing images has promoted the development of large-scale urban 3D reconstruction. However, remote sensing multi-view image data suffers from the problems of occlusion and uneven brightness between views during acquisition, which leads to the problem of blurred details in depth estimation. To solve the above problem, we re-examine the deformable learning method in the Multi-View Stereo task and propose a novel paradigm based on view Space and Depth deformable Learning (SDL-MVS), aiming to learn deformable interactions of features in different view spaces and deformably model the depth ranges and intervals to enable high accurate depth estimation. Specifically, to solve the problem of view noise caused by occlusion and uneven brightness, we propose a Progressive Space deformable Sampling (PSS) mechanism, which performs deformable learning of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSatellite Image Processing and Photogrammetry · Advanced Vision and Imaging · Remote Sensing and Land Use

MethodsMasked autoencoder