Fine-tuning deep learning models for stereo matching using results from semi-global matching
Hessah Albanwan, Rongjun Qin

TL;DR
This paper proposes a finetuning approach for deep learning stereo matching models using disparity maps from semi-global matching, enhancing their transferability to diverse satellite imagery without requiring ground-truth data.
Contribution
The paper introduces a simple finetuning scheme leveraging SGM-derived disparity maps to improve DL models' transferability across varied remote sensing datasets.
Findings
Improved transferability of DL stereo matching models across different regions.
Demonstrated effectiveness on 20 diverse satellite study-sites.
Visual and numerical performance improvements observed.
Abstract
Deep learning (DL) methods are widely investigated for stereo image matching tasks due to their reported high accuracies. However, their transferability/generalization capabilities are limited by the instances seen in the training data. With satellite images covering large-scale areas with variances in locations, content, land covers, and spatial patterns, we expect their performances to be impacted. Increasing the number and diversity of training data is always an option, but with the ground-truth disparity being limited in remote sensing due to its high cost, it is almost impossible to obtain the ground-truth for all locations. Knowing that classical stereo matching methods such as Census-based semi-global-matching (SGM) are widely adopted to process different types of stereo data, we therefore, propose a finetuning method that takes advantage of disparity maps derived from SGM on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Satellite Image Processing and Photogrammetry
