A Convex Optimization Approach to Dynamic Programming in Continuous State and Action Spaces
Insoon Yang

TL;DR
This paper introduces a convex optimization-based method for solving dynamic programming problems in continuous spaces, enabling efficient approximation of the value function and control policies with convergence guarantees.
Contribution
It presents a novel convex optimization framework for dynamic programming in continuous spaces, offering convergence guarantees and suboptimality bounds for control-affine systems.
Findings
The method approximates the optimal value function with uniform convergence.
It provides a control policy with performance converging to the optimum as grid resolution improves.
The approach extends to nonlinear control-affine systems with provable suboptimality bounds.
Abstract
In this paper, a convex optimization-based method is proposed for numerically solving dynamic programs in continuous state and action spaces. The key idea is to approximate the output of the Bellman operator at a particular state by the optimal value of a convex program. The approximate Bellman operator has a computational advantage because it involves a convex optimization problem in the case of control-affine systems and convex costs. Using this feature, we propose a simple dynamic programming algorithm to evaluate the approximate value function at pre-specified grid points by solving convex optimization problems in each iteration. We show that the proposed method approximates the optimal value function with a uniform convergence property in the case of convex optimal value functions. We also propose an interpolation-free design method for a control policy, of which performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
