Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation
Zhitong Xiong, Yi Wang, Fahong Zhang, Adam J. Stewart, Jo\"elle Hanna, Damian Borth, Ioannis Papoutsis, Bertrand Le Saux, Gustau Camps-Valls, Xiao Xiang Zhu

TL;DR
The paper introduces DOFA, a neural plasticity-inspired multimodal foundation model for Earth Observation that flexibly processes diverse satellite sensor data, achieving state-of-the-art results with efficient training.
Contribution
It presents a novel dynamic hypernetwork approach enabling a single model to handle multiple EO sensor modalities, improving generalization and efficiency.
Findings
State-of-the-art performance on multiple EO tasks
Effective generalization to unseen sensor modalities
Reduced computational resources with hybrid continual pretraining
Abstract
Earth observation (EO) in open-world settings presents a unique challenge: different applications rely on diverse sensor modalities, each with varying ground sampling distances, spectral ranges, and numbers of spectral bands. However, existing EO foundation models are typically tailored to specific sensor types, making them inflexible when generalizing across the heterogeneous landscape of EO data. To address this, we propose the Dynamic One-For-All (DOFA) model, a unified, multimodal foundation framework designed for diverse vision tasks in EO. Inspired by neural plasticity, DOFA utilizes a wavelength-conditioned dynamic hypernetwork to process inputs from five distinct satellite sensors flexibly. By continually pretraining on five EO modalities, DOFA achieves state-of-the-art performance across multiple downstream tasks and generalizes well to unseen modalities. Enhanced with hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Neural Networks and Applications · Seismology and Earthquake Studies
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Multi-Head Attention · Softmax · Dropout
