Probabilistic Spatial Interpolation of Sparse Data using Diffusion Models
Valerie Tsao, Nathaniel W. Chaney, Manolis Veveakis

TL;DR
This paper introduces a diffusion model-based framework for reconstructing complete temperature fields from extremely sparse observational data, addressing data gaps in climate modeling and forecasting.
Contribution
It presents a novel conditional data imputation method that reconstructs full temperature fields from as little as 1% observational coverage using diffusion models.
Findings
Achieves high accuracy in temperature field reconstruction with minimal data.
Effective across various observational densities, from swath data to isolated sensors.
Validated on Southern Great Plains summer data from 2018-2020.
Abstract
The large underlying assumption of climate models today relies on the basis of a "confident" initial condition, a reasonably plausible snapshot of the Earth for which all future predictions depend on. However, given the inherently chaotic nature of our system, this assumption is complicated by sensitive dependence, where small uncertainties in initial conditions can lead to exponentially diverging outcomes over time. This challenge is particularly salient at global spatial scales and over centennial timescales, where data gaps are not just common but expected. The source of uncertainty is two-fold: (1) sparse, noisy observations from satellites and ground stations, and (2) internal variability stemming from the simplifying approximations within the models themselves. In practice, data assimilation methods are used to reconcile this missing information by conditioning model states on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoil Geostatistics and Mapping
MethodsDiffusion
