Continuous-Time Robust Dynamic Programming

Tao Bian; Zhong-Ping Jiang

arXiv:1809.05867·math.OC·September 18, 2018·SIAM J. Control. Optim.

Continuous-Time Robust Dynamic Programming

Tao Bian, Zhong-Ping Jiang

PDF

Open Access

TL;DR

This paper introduces a new robust dynamic programming theory for continuous-time systems, enhancing the analysis of robustness and enabling the development of adaptive optimal control and reinforcement learning methods.

Contribution

The paper develops a novel robust dynamic programming framework for continuous-time systems, distinct from traditional DP, with applications in control and reinforcement learning.

Findings

01

Four illustrative applications in stochastic control and adaptive DP.

02

Three numerical examples from finance and engineering.

03

Potential extensions of the framework are discussed.

Abstract

This paper presents a new theory, known as robust dynamic pro- gramming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze the robustness of DP algorithms, and in par- ticular, to develop novel adaptive optimal control and reinforcement learning methods. In order to demonstrate the potential of this new framework, four illustrative applications in the fields of stochastic optimal control and adaptive DP are presented. Three numerical examples arising from both finance and engineering industries are also given, along with several possible extensions of the proposed framework.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Optimization and Variational Analysis