Reinforcement Learning, Optimal Control, and Bayesian Filtering in Data Assimilation
Abed Hammoud

TL;DR
This paper unifies Bayesian filtering, smoothing, variational data assimilation, and control within a single variational framework, clarifying their relationships and conditions for optimality.
Contribution
It introduces a finite-horizon variational formulation that explicitly connects various data assimilation and control methods, providing new insights into their theoretical foundations.
Findings
Identifies the evidence as a global infimum in the variational hierarchy.
Shows strong- and weak-constraint 4D-Var are MAP estimators under Gaussian assumptions.
Demonstrates the ensemble Kalman filter as a Gaussian approximation in the linear-Gaussian limit.
Abstract
We give a finite-horizon variational formulation that places Bayesian filtering and smoothing, variational data assimilation, KL-regularized control, and Kalman-type methods inside one mathematically explicit hierarchy. For a discrete-time hidden Markov model and any admissible one-step candidate law , We prove , and, for any admissible path law , . These identities determine the evidence as the global infimum and make the analysis and smoothing posteriors the unique minimizers whenever those posterior laws belong to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
