Pixel-Wise Multimodal Contrastive Learning for Remote Sensing Images
Leandro Stival, Ricardo da Silva Torres, Helio Pedrini

TL;DR
This paper introduces a novel pixel-wise multimodal contrastive learning framework for remote sensing images that leverages 2D representations from satellite image time series to improve feature extraction and downstream task performance.
Contribution
It proposes a new multimodal self-supervised approach using recurrence plots from pixel-wise vegetation indices, outperforming existing methods in Earth observation tasks.
Findings
Enhanced feature extraction from SITS using 2D representations.
Contrastive learning improves representation quality for remote sensing data.
Outperforms state-of-the-art methods on multiple Earth observation benchmarks.
Abstract
Satellites continuously generate massive volumes of data, particularly for Earth observation, including satellite image time series (SITS). However, most deep learning models are designed to process either entire images or complete time series sequences to extract meaningful features for downstream tasks. In this study, we propose a novel multimodal approach that leverages pixel-wise two-dimensional (2D) representations to encode visual property variations from SITS more effectively. Specifically, we generate recurrence plots from pixel-based vegetation index time series (NDVI, EVI, and SAVI) as an alternative to using raw pixel values, creating more informative representations. Additionally, we introduce PIxel-wise Multimodal Contrastive (PIMC), a new multimodal self-supervision approach that produces effective encoders based on two-dimensional pixel time series representations and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing in Agriculture · Remote-Sensing Image Classification · Time Series Analysis and Forecasting
