Reinforcement Learning, Optimal Control, and Bayesian Filtering in Data Assimilation

Abed Hammoud

arXiv:2604.12158·math.DS·April 15, 2026

Reinforcement Learning, Optimal Control, and Bayesian Filtering in Data Assimilation

Abed Hammoud

PDF

TL;DR

This paper unifies Bayesian filtering, smoothing, variational data assimilation, and control within a single variational framework, clarifying their relationships and conditions for optimality.

Contribution

It introduces a finite-horizon variational formulation that explicitly connects various data assimilation and control methods, providing new insights into their theoretical foundations.

Findings

01

Identifies the evidence as a global infimum in the variational hierarchy.

02

Shows strong- and weak-constraint 4D-Var are MAP estimators under Gaussian assumptions.

03

Demonstrates the ensemble Kalman filter as a Gaussian approximation in the linear-Gaussian limit.

Abstract

We give a finite-horizon variational formulation that places Bayesian filtering and smoothing, variational data assimilation, KL-regularized control, and Kalman-type methods inside one mathematically explicit hierarchy. For a discrete-time hidden Markov model and any admissible one-step candidate law $q_{t}$ , We prove $J_{t} (q_{t}) = E_{q_{t}} [- lo g p (y_{t} ∣ X_{t})] + KL (q_{t} ∥ p_{t}^{f}) = KL (q_{t} ∥ p_{t}^{a}) - lo g p (y_{t} ∣ y_{0 : t - 1})$ , and, for any admissible path law $q$ , $J_{path} (q) = E_{q} [- \sum_{t = 0}^{T} lo g p (y_{t} ∣ X_{t})] + KL (q ∥ p (x_{0 : T})) = KL (q ∥ p (x_{0 : T} ∣ y_{0 : T})) - lo g p (y_{0 : T})$ . These identities determine the evidence as the global infimum and make the analysis and smoothing posteriors the unique minimizers whenever those posterior laws belong to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.