Continuity of Filters for Discrete-Time Control Problems Defined by   Explicit Equations

Eugene A. Feinberg; Sayaka Ishizawa; Pavlo O. Kasyanov; David N.; Kraemer

arXiv:2311.12184·math.OC·February 5, 2025·SIAM J. Control. Optim.·2 cites

Continuity of Filters for Discrete-Time Control Problems Defined by Explicit Equations

Eugene A. Feinberg, Sayaka Ishizawa, Pavlo O. Kasyanov, David N., Kraemer

PDF

Open Access

TL;DR

This paper establishes conditions ensuring weak continuity of filters in discrete-time stochastic control problems, which guarantees the existence of optimal policies and convergence of value iteration.

Contribution

It provides new criteria for weak continuity of transition probabilities in filters, aiding the analysis of optimal control in partially observable systems.

Findings

01

Weak continuity of filters is guaranteed under specified conditions.

02

Continuity in total variation of transition probabilities is established.

03

Applications demonstrate the practical relevance of the theoretical results.

Abstract

Discrete time control systems whose dynamics and observations are described by stochastic equations are common in engineering, operations research, health care, and economics. For example, stochastic filtering problems are usually defined via stochastic equations. These problems can be reduced to Markov decision processes (MDPs) whose states are posterior state distributions, and transition probabilities for such MDPs are sometimes called filters. This paper investigates sufficient conditions on transition and observation functions for the original problems to guarantee weak continuity of the filter. Under mild conditions on cost functions, weak continuity implies the existence of optimal policies minimizing the expected total costs, the validity of optimality equations, and convergence of value iterations to optimal values. This paper uses recent results on weak continuity of filters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization