A measure-valued HJB perspective on Bayesian optimal adaptive control

Alexander M.G. Cox; Sigrid K\"allblad; Chaorui Wang

arXiv:2502.12957·math.OC·December 22, 2025

A measure-valued HJB perspective on Bayesian optimal adaptive control

Alexander M.G. Cox, Sigrid K\"allblad, Chaorui Wang

PDF

Open Access

TL;DR

This paper develops a measure-valued HJB framework for Bayesian adaptive control problems with complex posterior-dependent costs, providing theoretical characterizations and near-optimal control strategies.

Contribution

It introduces a novel measure-valued HJB approach for Bayesian control with non-linear posterior costs, including stability results for measure-valued SDEs.

Findings

01

Characterization of the value function as a viscosity solution.

02

Construction of near-optimal piecewise constant controls.

03

A stability result for measure-valued SDEs.

Abstract

We consider a Bayesian adaptive optimal stochastic control problem where a hidden static signal has a non-separable influence on the drift of a noisy observation. Being allowed to control the specific form of this dependence, we aim at optimising a cost functional depending on the posterior distribution of the hidden signal. Our setup is in sharp contrast to existing work: we include costs that depend on the full posterior distribution in a form that admits a large class of non-linear relationships. Expressing the dynamics of this posterior distribution in the observation filtration, we embed our problem into a genuinely infinite-dimensional stochastic control problem using measure-valued martingales. We address this problem by use of viscosity theory and approximation arguments. Specifically, we show equivalence to a corresponding weak formulation, characterise the optimal value of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Control Systems and Identification