LO-SDA: Latent Optimization for Score-based Atmospheric Data Assimilation

Jing-An Sun; Hang Fan; Junchao Gong; Ben Fei; Kun Chen; Fenghua Ling; Wenlong Zhang; Wanghan Xu; Li Yan; Pierre Gentine; Lei Bai

arXiv:2510.22562·physics.ao-ph·October 28, 2025

LO-SDA: Latent Optimization for Score-based Atmospheric Data Assimilation

Jing-An Sun, Hang Fan, Junchao Gong, Ben Fei, Kun Chen, Fenghua Ling, Wenlong Zhang, Wanghan Xu, Li Yan, Pierre Gentine, Lei Bai

PDF

3 Reviews

TL;DR

LO-SDA introduces a novel generative data assimilation method using latent space diffusion models, outperforming traditional Gaussian prior-based methods in atmospheric prediction tasks.

Contribution

The paper presents a new generative DA approach with latent optimization and diffusion models, removing Gaussian assumptions and improving accuracy in atmospheric data assimilation.

Findings

01

LO-SDA outperforms traditional DA methods in idealized experiments.

02

Diffusion-based DA surpasses score-based methods in accuracy.

03

First demonstration of diffusion models outperforming traditional DA in high-dimensional atmospheric systems.

Abstract

Data assimilation (DA) plays a pivotal role in numerical weather prediction by systematically integrating sparse observations with model forecasts to estimate optimal atmospheric initial condition for forthcoming forecasts. Traditional Bayesian DA methods adopt a Gaussian background prior as a practical compromise for the curse of dimensionality in atmospheric systems, that simplifies the nonlinear nature of atmospheric dynamics and can result in biased estimates. To address this limitation, we propose a novel generative DA method, LO-SDA. First, a variational autoencoder is trained to learn compact latent representations that disentangle complex atmospheric correlations. Within this latent space, a background-conditioned diffusion model is employed to directly learn the conditional distribution from data, thereby generalizing and removing assumptions in the Gaussian prior in…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 6Confidence 3

Strengths

Clear motivation. The work targets the Gaussian-prior limitation of classical DA and builds a principled bridge between score-based and variational formulations. Methodological coherence. The latent optimisation scheme is well justified and neatly integrated into the diffusion framework. Algorithm 1 clearly contrasts it with DPS guidance. Presentation quality. Figures are clear and the connection to variational DA is articulated.

Weaknesses

- Incremental novelty. The paper’s central idea—enforcing observation consistency within score-based DA—is conceptually close to existing approaches such as score-based DA, DiffDA, and Manshausen et al., 2025. LO-SDA differs mainly by performing optimization in latent space rather than directly in model space and by framing the repeated guidance as “latent optimization.” This is a useful but evolutionary step rather than a paradigm shift. - Overstated claims. The manuscript repeatedly suggests

Reviewer 02Rating 6Confidence 2

Strengths

Establishes a theoretical bridge between variational DA and score-based diffusion modeling.

Weaknesses

- Computational cost for latent optimization process is expensive - The work mainly integrates known elements (VAE compression + score-based DA + optimization) rather than proposing fundamentally new generative principles.

Reviewer 03Rating 4Confidence 4

Strengths

Originality: The paper proposes a creative combination of deep generative modeling with data assimilation. Using a conditional diffusion model in a VAE latent space to perform Bayesian inference is a novel approach in the context of atmospheric data assimilation. The introduction of a latent optimization step to enforce observation consistency is an innovative twist that addresses a known issue in score-based filters (i.e., ensuring the analysis state actually fits the sparse observations). Qua

Weaknesses

Baseline Comparisons: A notable weakness is the absence of comparison with some established DA methods like the Ensemble Kalman Filter (e.g., LETKF). LETKF and similar ensemble Kalman techniques are widely used operationally and are strong benchmarks for assimilation performance. The paper claims to outperform “traditional approaches,” but this is only demonstrated against 3DVar/L3DVar. Without experiments against an ensemble Kalman filter, it is unclear how LO-SDA stands relative to the true st

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.