COP-GEN: Latent Diffusion Transformer for Copernicus Earth Observation Data
Miguel Espinosa, Eva Gmelich Meijling, Valerio Marsocci, Elliot J. Crowley, Mikolaj Czerkawski

TL;DR
COP-GEN is a novel multimodal latent diffusion transformer that models the joint distribution of diverse Earth observation data, enabling flexible, uncertainty-aware cross-modal generation and translation without retraining.
Contribution
It introduces a stochastic generative model for EO data that captures cross-modal structure and uncertainty, outperforming deterministic methods and providing a new benchmark for evaluation.
Findings
COP-GEN covers 90% of the real observation manifold on the benchmark.
It maintains strong peak fidelity across modalities.
It enables zero-shot cross-modal translation without retraining.
Abstract
Earth observation applications increasingly rely on data from multiple sensors, including optical, radar, elevation, and land-cover. Relationships between modalities are fundamental for data integration but are inherently non-injective: identical conditioning information can correspond to multiple physically plausible observations, and should be parametrised as conditional distributions. Deterministic models, by contrast, collapse toward conditional means and fail to represent the uncertainty and variability required for tasks such as data completion and cross-sensor translation. We introduce COP-GEN, a multimodal latent diffusion transformer that models the joint distribution of heterogeneous EO modalities at their native spatial resolutions. By parameterising cross-modal mappings as conditional distributions, COP-GEN enables flexible any-to-any conditional generation, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
