How to Embed Matters: Evaluation of EO Embedding Design Choices
Luis Gilch, Isabelle Wittmann, Maximilian Nitsche, Johannes Jakubik, Arne Ewald, Thomas Brunschwiler

TL;DR
This paper systematically analyzes how design choices in Earth observation embeddings affect downstream performance, emphasizing the importance of architecture, training, and aggregation strategies for scalable, effective workflows.
Contribution
It provides a comprehensive evaluation of embedding design factors in GeoFMs, offering practical insights for optimizing EO workflows with compact, reusable representations.
Findings
Transformer backbones with mean pooling are effective default embeddings.
Intermediate ResNet layers can outperform final layers in some tasks.
Combining embeddings from different objectives enhances robustness.
Abstract
Earth observation (EO) missions produce petabytes of multispectral imagery, increasingly analyzed using large Geospatial Foundation Models (GeoFMs). Alongside end-to-end adaptation, workflows make growing use of intermediate representations as task-agnostic embeddings, enabling models to compute representations once and reuse them across downstream tasks. Consequently, when GeoFMs act as feature extractors, decisions about how representations are obtained, aggregated, and combined affect downstream performance and pipeline scalability. Understanding these trade-offs is essential for scalable embedding-based EO workflows, where compact embeddings can replace raw data while remaining broadly useful. We present a systematic analysis of embedding design in GeoFM-based EO workflows. Leveraging NeuCo-Bench, we study how backbone architecture, pretraining strategy, representation depth,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
