REOBench: Benchmarking Robustness of Earth Observation Foundation Models

Xiang Li; Yong Tao; Siyuan Zhang; Siwei Liu; Zhitong Xiong; Chunbo Luo; Lu Liu; Mykola Pechenizkiy; Xiao Xiang Zhu; Tianjin Huang

arXiv:2505.16793·cs.CV·October 24, 2025

REOBench: Benchmarking Robustness of Earth Observation Foundation Models

Xiang Li, Yong Tao, Siyuan Zhang, Siwei Liu, Zhitong Xiong, Chunbo Luo, Lu Liu, Mykola Pechenizkiy, Xiao Xiang Zhu, Tianjin Huang

PDF

Open Access 1 Repo 1 Datasets

TL;DR

REOBench is a comprehensive benchmark that evaluates the robustness of Earth observation foundation models against various real-world image corruptions across multiple tasks, revealing significant vulnerabilities and guiding future improvements.

Contribution

This work introduces REOBench, the first benchmark specifically designed to assess the robustness of Earth observation foundation models under diverse corruptions.

Findings

01

Models degrade significantly under input corruptions.

02

Robustness varies across tasks, architectures, and corruption types.

03

Vision-language models exhibit enhanced robustness.

Abstract

Earth observation foundation models have shown strong generalization across multiple Earth observation tasks, but their robustness under real-world perturbations remains underexplored. To bridge this gap, we introduce REOBench, the first comprehensive benchmark for evaluating the robustness of Earth observation foundation models across six tasks and twelve types of image corruptions, including both appearance-based and geometric perturbations. To ensure realistic and fine-grained evaluation, our benchmark focuses on high-resolution optical remote sensing images, which are widely used in critical applications such as urban planning and disaster response. We conduct a systematic evaluation of a broad range of models trained using masked image modeling, contrastive learning, and vision-language pre-training paradigms. Our results reveal that (1) existing Earth observation foundation models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lx709/reobench
pytorchOfficial

Datasets

xiang709/REOBench
dataset· 11 dl
11 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning