General limit value in Dynamic Programming

J\'er\^ome Renault (GREMAQ)

arXiv:1301.0451·math.OC·January 4, 2013

General limit value in Dynamic Programming

J\'er\^ome Renault (GREMAQ)

PDF

Open Access

TL;DR

This paper establishes conditions under which a unique limit value exists in dynamic programming problems as decision-makers become infinitely patient, unifying various payoff models and providing a comprehensive theoretical framework.

Contribution

It introduces a general condition for the uniform convergence of value functions in dynamic programming as patience tends to infinity, identifying a unique limit value independent of evaluation sequences.

Findings

01

Uniform convergence occurs iff the sequence of value functions is totally bounded.

02

A unique limit value function $v^*$ exists, independent of the evaluation sequence.

03

The results apply to discounted, average, and stochastic transition models.

Abstract

We consider a dynamic programming problem with arbitrary state space and bounded rewards. Is it possible to define in an unique way a limit value for the problem, where the "patience" of the decision-maker tends to infinity ? We consider, for each evaluation $θ$ (a probability distribution over positive integers) the value function $v_{θ}$ of the problem where the weight of any stage $t$ is given by $θ_{t}$ , and we investigate the uniform convergence of a sequence $(v_{θ^{k}})_{k}$ when the "impatience" of the evaluations vanishes, in the sense that $\sum_{t} ∣ θ_{t}^{k} - θ_{t + 1}^{k} ∣ \to_{k \to \infty} 0$ . We prove that this uniform convergence happens if and only if the metric space $v_{θ^{k}}, k \geq 1$ is totally bounded. Moreover there exists a particular function $v^{*}$ , independent of the particular chosen sequence $(θ^{k})_{k}$ , such that any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic theories and models · Risk and Portfolio Optimization · Supply Chain and Inventory Management