SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing
Yong-Qiang Mao, Hanbo Bi, Liangyu Xu, Kaiqiang Chen, Zhirui Wang, Xian, Sun, Kun Fu

TL;DR
SDL-MVS introduces a deformable learning paradigm for multi-view stereo in remote sensing, effectively handling occlusion and brightness variations to improve depth estimation accuracy in large-scale urban 3D reconstruction.
Contribution
It proposes view space and depth deformable learning methods, including PSS and DHD, to enhance feature interaction and depth modeling in multi-view stereo reconstruction.
Findings
Achieves state-of-the-art performance on LuoJia-MVS and WHU datasets.
Attains an MAE error of 0.086 in depth estimation.
Reaches 98.9% accuracy for <0.6m depth error.
Abstract
Research on multi-view stereo based on remote sensing images has promoted the development of large-scale urban 3D reconstruction. However, remote sensing multi-view image data suffers from the problems of occlusion and uneven brightness between views during acquisition, which leads to the problem of blurred details in depth estimation. To solve the above problem, we re-examine the deformable learning method in the Multi-View Stereo task and propose a novel paradigm based on view Space and Depth deformable Learning (SDL-MVS), aiming to learn deformable interactions of features in different view spaces and deformably model the depth ranges and intervals to enable high accurate depth estimation. Specifically, to solve the problem of view noise caused by occlusion and uneven brightness, we propose a Progressive Space deformable Sampling (PSS) mechanism, which performs deformable learning of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSatellite Image Processing and Photogrammetry · Advanced Vision and Imaging · Remote Sensing and Land Use
MethodsMasked autoencoder
