A measure-valued HJB perspective on Bayesian optimal adaptive control
Alexander M.G. Cox, Sigrid K\"allblad, Chaorui Wang

TL;DR
This paper develops a measure-valued HJB framework for Bayesian adaptive control problems with complex posterior-dependent costs, providing theoretical characterizations and near-optimal control strategies.
Contribution
It introduces a novel measure-valued HJB approach for Bayesian control with non-linear posterior costs, including stability results for measure-valued SDEs.
Findings
Characterization of the value function as a viscosity solution.
Construction of near-optimal piecewise constant controls.
A stability result for measure-valued SDEs.
Abstract
We consider a Bayesian adaptive optimal stochastic control problem where a hidden static signal has a non-separable influence on the drift of a noisy observation. Being allowed to control the specific form of this dependence, we aim at optimising a cost functional depending on the posterior distribution of the hidden signal. Our setup is in sharp contrast to existing work: we include costs that depend on the full posterior distribution in a form that admits a large class of non-linear relationships. Expressing the dynamics of this posterior distribution in the observation filtration, we embed our problem into a genuinely infinite-dimensional stochastic control problem using measure-valued martingales. We address this problem by use of viscosity theory and approximation arguments. Specifically, we show equivalence to a corresponding weak formulation, characterise the optimal value of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Control Systems and Identification
