Stable Optimal Control and Semicontractive Dynamic Programming
Dimitri P. Bertsekas

TL;DR
This paper introduces a new framework for analyzing stable policies in infinite horizon deterministic optimal control, extending classical results to cases where policies may not stabilize the system, and characterizes solutions to Bellman's equation.
Contribution
It proposes a unifying notion of stable feedback policies based on cost perturbation, and characterizes the solutions and convergence properties of modified dynamic programming algorithms.
Findings
$ ilde J$ and $ar J$ are solutions to Bellman's equation.
The smallest and largest solutions are $ ilde J$ and $J^+$.
Modified value and policy iteration algorithms have specific convergence regions.
Abstract
We consider discrete-time infinite horizon deterministic optimal control problems with nonnegative cost per stage, and a destination that is cost-free and absorbing. The classical linear-quadratic regulator problem is a special case. Our assumptions are very general, and allow the possibility that the optimal policy may not be stabilizing the system, e.g., may not reach the destination either asymptotically or in a finite number of steps. We introduce a new unifying notion of stable feedback policy, based on perturbation of the cost per stage, which in addition to implying convergence of the generated states to the destination, quantifies the speed of convergence. We consider the properties of two distinct cost functions: , the overall optimal, and , the restricted optimal over just the stable policies. Different classes of stable policies (with different speeds of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
