Multi-Label Guided Soft Contrastive Learning for Efficient Earth Observation Pretraining
Yi Wang, Conrad M Albrecht, Xiao Xiang Zhu

TL;DR
This paper introduces a soft contrastive learning framework utilizing land-cover data and cross-domain pretraining to efficiently develop Earth observation foundation models with improved downstream task performance.
Contribution
It proposes a novel soft contrastive learning method that leverages multi-label land-cover supervision and explores cross-domain continual pretraining for multispectral and SAR imagery.
Findings
Achieves state-of-the-art results on multiple downstream tasks.
Improves pretraining efficiency and effectiveness with additional resources.
Produces models outperforming existing SOTA in 10 out of 11 tasks.
Abstract
Self-supervised pretraining on large-scale satellite data has raised great interest in building Earth observation (EO) foundation models. However, many important resources beyond pure satellite imagery, such as land-cover-land-use products that provide free global semantic information, as well as vision foundation models that hold strong knowledge of the natural world, are not widely studied. In this work, we show these free additional resources not only help resolve common contrastive learning bottlenecks, but also significantly boost the efficiency and effectiveness of EO pretraining. Specifically, we first propose soft contrastive learning that optimizes cross-scene soft similarity based on land-cover-generated multi-label supervision, naturally solving the issue of multiple positive samples and too strict positive matching in complex scenes. Second, we revisit and explore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment
MethodsContrastive Learning
