A Uniform-grid Discretization Algorithm for Stochastic Control with Risk   Constraints

Yin-Lam Chow; Marco Pavone

arXiv:1501.02024·math.OC·January 12, 2015

A Uniform-grid Discretization Algorithm for Stochastic Control with Risk Constraints

Yin-Lam Chow, Marco Pavone

PDF

Open Access

TL;DR

This paper introduces a discretization algorithm for risk-constrained dynamic programming in stochastic control, addressing computational challenges by discretizing continuous spaces and providing error bounds for the approximation.

Contribution

It proposes a finite space approximation method for risk-constrained dynamic programming, with proven linear error bounds related to discretization step size.

Findings

01

Discretization effectively approximates the Bellman operator in risk-constrained problems.

02

Error bounds of the approximation are linearly related to discretization step size.

03

Implementation details and potential modifications are discussed.

Abstract

In this paper, we present a discretization algorithm for finite horizon risk constrained dynamic programming algorithm in [Chow_Pavone_13]. Although in a theoretical standpoint, Bellman's recursion provides a systematic way to find optimal value functions and generate optimal history dependent policies, there is a serious computational issue. Even if the state space and action space of this constrained stochastic optimal control problem are finite, the spaces of risk threshold and the feasible risk update are closed bounded subset of real numbers. This prohibits any direct applications of unconstrained finite state iterative methods in dynamic programming found in [Bertsekas_05]. In order to approximate Bellman's operator derived in [Chow_Pavone_13], we discretize the continuous action spaces and formulate a finite space approximation for the exact dynamic programming algorithm. We will…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Economic theories and models · Reinforcement Learning in Robotics