Policy iteration: for want of recursive feasibility, all is not lost
Mathieu Granzotto, Olivier Lindamulage De Silva, Romain Postoyan,, Dragan Nesic, and Zhong-Ping Jiang

TL;DR
This paper analyzes the recursive feasibility and stability of policy iteration (PI) for nonlinear systems, introduces modifications to ensure feasibility, and demonstrates near-optimal control with stability guarantees.
Contribution
It provides novel conditions for recursive robust stability of PI, introduces PI+ to guarantee recursive feasibility, and maintains near-optimality in nonlinear control.
Findings
PI can ensure recursive robust stability under certain conditions.
Modified PI+ guarantees recursive feasibility and stability.
PI+ maintains near-optimality similar to standard PI.
Abstract
This paper investigates recursive feasibility, recursive robust stability and near-optimality properties of policy iteration (PI). For this purpose, we consider deterministic nonlinear discrete-time systems whose inputs are generated by PI for undiscounted cost functions. We first assume that PI is recursively feasible, in the sense that the optimization problems solved at each iteration admit a solution. In this case, we provide novel conditions to establish recursive robust stability properties for a general attractor, meaning that the policies generated at each iteration ensure a robust \KL-stability property with respect to a general state measure. We then derive novel explicit bounds on the mismatch between the (suboptimal) value function returned by PI at each iteration and the optimal one. Afterwards, motivated by a counter-example that shows that PI may fail to be recursively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Mechanical Circulatory Support Devices · Fuel Cells and Related Materials
