Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Dimitri Bertsekas

arXiv:1608.01393·math.OC·November 29, 2017·IEEE Trans. Autom. Control.

Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Dimitri Bertsekas

PDF

TL;DR

This paper studies a broad class of infinite horizon control models with affine dynamic programming equations, establishing conditions for solution uniqueness and algorithm validity, especially when the affine mapping exhibits semicontractive behavior.

Contribution

It introduces a unified framework for affine monotonic models, proving solution uniqueness and algorithm convergence under semicontractive conditions, extending classical Markov decision process results.

Findings

01

Unique solution of Bellman's equation under certain assumptions

02

Validity of value and policy iteration for contractive policies

03

Weaker results without semicontractive assumptions

Abstract

In this paper we consider a broad class of infinite horizon discrete-time optimal control models that involve a nonnegative cost function and an affine mapping in their dynamic programming equation. They include as special cases classical models such as stochastic undiscounted nonnegative cost problems, stochastic multiplicative cost problems, and risk-sensitive problems with exponential cost. We focus on the case where the state space is finite and the control space has some compactness properties. We assume that the affine mapping has a semicontractive character, whereby for some policies it is a contraction, while for others it is not. In one line of analysis, we impose assumptions that guarantee that the latter policies cannot be optimal. Under these assumptions, we prove strong results that resemble those for discounted Markovian decision problems, such as the uniqueness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.