A Convex Optimization Approach to Dynamic Programming in Continuous   State and Action Spaces

Insoon Yang

arXiv:1810.03847·math.OC·October 23, 2020

A Convex Optimization Approach to Dynamic Programming in Continuous State and Action Spaces

Insoon Yang

PDF

TL;DR

This paper introduces a convex optimization-based method for solving dynamic programming problems in continuous spaces, enabling efficient approximation of the value function and control policies with convergence guarantees.

Contribution

It presents a novel convex optimization framework for dynamic programming in continuous spaces, offering convergence guarantees and suboptimality bounds for control-affine systems.

Findings

01

The method approximates the optimal value function with uniform convergence.

02

It provides a control policy with performance converging to the optimum as grid resolution improves.

03

The approach extends to nonlinear control-affine systems with provable suboptimality bounds.

Abstract

In this paper, a convex optimization-based method is proposed for numerically solving dynamic programs in continuous state and action spaces. The key idea is to approximate the output of the Bellman operator at a particular state by the optimal value of a convex program. The approximate Bellman operator has a computational advantage because it involves a convex optimization problem in the case of control-affine systems and convex costs. Using this feature, we propose a simple dynamic programming algorithm to evaluate the approximate value function at pre-specified grid points by solving convex optimization problems in each iteration. We show that the proposed method approximates the optimal value function with a uniform convergence property in the case of convex optimal value functions. We also propose an interpolation-free design method for a control policy, of which performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.