Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation
Hongbin Xu, Zhipeng Zhou, Yu Qiao, Wenxiong Kang, Qiuxia Wu

TL;DR
This paper introduces a self-supervised multi-view stereo framework that leverages semantic co-segmentation and data-augmentation to improve 3D reconstruction accuracy without relying on color consistency across views.
Contribution
It proposes a novel self-supervised approach using semantic co-segmentation and data-augmentation to enhance multi-view stereo reconstruction performance.
Findings
Achieves state-of-the-art results on DTU dataset among unsupervised methods.
Competitively matches supervised methods in reconstruction quality.
Demonstrates strong generalization on Tanks&Temples dataset.
Abstract
Recent studies have witnessed that self-supervised methods based on view synthesis obtain clear progress on multi-view stereo (MVS). However, existing methods rely on the assumption that the corresponding points among different views share the same color, which may not always be true in practice. This may lead to unreliable self-supervised signal and harm the final reconstruction performance. To address the issue, we propose a framework integrated with more reliable supervision guided by semantic co-segmentation and data-augmentation. Specially, we excavate mutual semantic from multi-view images to guide the semantic consistency. And we devise effective data-augmentation mechanism which ensures the transformation robustness by treating the prediction of regular samples as pseudo ground truth to regularize the prediction of augmented samples. Experimental results on DTU dataset show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Optical Coherence Tomography Applications
