MMEarth-Bench: Global Model Adaptation via Multimodal Test-Time Training
Lucia Gordon, Serge Belongie, Christian Igel, Nico Lang

TL;DR
This paper introduces MMEarth-Bench, a comprehensive multimodal geospatial benchmark dataset, and proposes a test-time training method to improve model robustness and geographic generalization in Earth observation tasks.
Contribution
It provides a new global multimodal benchmark dataset and a model-agnostic test-time training method to enhance model adaptation and robustness in geospatial applications.
Findings
Pretraining improves robustness in limited data settings.
Geographic generalization remains challenging.
Test-time training with multimodal reconstruction enhances performance.
Abstract
Recent research in geospatial machine learning has demonstrated that models pretrained with self-supervised learning on Earth observation data can perform well on downstream tasks with limited training data. However, most of the existing geospatial benchmark datasets have few data modalities and poor global representation, limiting the ability to evaluate multimodal pretrained models at global scales. To fill this gap, we introduce MMEarth-Bench, a collection of five new multimodal environmental tasks with 12 modalities, globally distributed data, and both in- and out-of-distribution test splits. We benchmark a diverse set of pretrained models and find that while (multimodal) pretraining tends to improve model robustness in limited data settings, geographic generalization abilities remain poor. In order to facilitate model adaptation to new downstream tasks and geographic domains, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification · Gaussian Processes and Bayesian Inference
