A Pontryagin Maximum Principle on the Belief Space for Continuous-Time Optimal Control with Discrete Observations

Christian Bayer; Saifeddine Ben naamia; Erik von Schwerin; Raul Tempone

arXiv:2512.24916·math.OC·January 1, 2026

A Pontryagin Maximum Principle on the Belief Space for Continuous-Time Optimal Control with Discrete Observations

Christian Bayer, Saifeddine Ben naamia, Erik von Schwerin, Raul Tempone

PDF

Open Access

TL;DR

This paper develops a Pontryagin maximum principle on the belief space for continuous-time stochastic control with discrete observations, linking optimality conditions to nonlinear filtering equations and providing a particle-based numerical solution.

Contribution

It introduces a novel maximum principle on the belief space for hybrid continuous-discrete control problems, connecting it to filtering equations and proposing a practical particle filtering algorithm.

Findings

01

The maximum principle provides necessary conditions for optimal control under partial observations.

02

The relationship between the adjoint process and the value functional gradient is established.

03

Numerical experiments demonstrate the effectiveness of the particle-based control scheme.

Abstract

We study a continuous time stochastic optimal control problem under partial observations that are available only at discrete time instants. This hybrid setting, with continuous dynamics and intermittent noisy measurements, arises in applications ranging from robotic exploration and target tracking to epidemic control. We formulate the problem on the space of beliefs (information states), treating the controller's posterior distribution of the state as the state variable for decision making. On this belief space we derive a Pontryagin maximum principle that provides necessary conditions for optimality. The analysis carefully tracks both the continuous evolution of the state between observation times and the Bayesian jump updates of the belief at observation instants. A key insight is a relationship between the adjoint process in our maximum principle and the gradient of the value…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Adaptive Dynamic Programming Control · Target Tracking and Data Fusion in Sensor Networks