CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation
Chenying Liu, Conrad Albrecht, Yi Wang, Xiao Xiang Zhu

TL;DR
This paper introduces CromSS, a novel cross-modal pretraining method that leverages noisy labels and multi-modal data to improve remote sensing image segmentation, demonstrating significant benefits over traditional approaches.
Contribution
CromSS is a new weakly supervised pretraining strategy that uses cross-modal consistency and noise mitigation to enhance feature learning from noisy labels in remote sensing.
Findings
NoLDO-S12 dataset assembled for pretraining and transfer learning.
CromSS improves segmentation performance on multiple datasets.
Effective use of noisy labels enhances feature learning.
Abstract
We explore the potential of large-scale noisily labeled data to enhance feature learning by pretraining semantic segmentation models within a multi-modal framework for geospatial applications. We propose a novel Cross-modal Sample Selection (CromSS) method, a weakly supervised pretraining strategy designed to improve feature representations through cross-modal consistency and noise mitigation techniques. Unlike conventional pretraining approaches, CromSS exploits massive amounts of noisy and easy-to-come-by labels for improved feature learning beneficial to semantic segmentation tasks. We investigate middle and late fusion strategies to optimize the multi-modal pretraining architecture design. We also introduce a cross-modal sample selection module to mitigate the adverse effects of label noise, which employs a cross-modal entangling strategy to refine the estimated confidence masks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Medical Image Segmentation Techniques
