Time-inconsistent Risk-sensitive Equilibrium for Countable-stated Markov   Decision Processes

Hongwei Mei

arXiv:1909.06863·math.OC·October 22, 2020

Time-inconsistent Risk-sensitive Equilibrium for Countable-stated Markov Decision Processes

Hongwei Mei

PDF

Open Access

TL;DR

This paper investigates time-inconsistent risk-sensitive control in countable-state Markov decision processes, establishing equilibrium strategies and their convergence as risk sensitivity diminishes.

Contribution

It introduces the concept of time-inconsistent equilibrium strategies for risk-sensitive MDPs and proves their existence and convergence in the limit case.

Findings

01

Existence of time-inconsistent equilibrium strategies.

02

Convergence of $ ext{e}$-equilibriums as $ ext{e} o 0^+$.

03

Validation of step-optimality in the control problem.

Abstract

This paper is devoted to solving a time-inconsistent risk-sensitive control problem with parameter $\e$ and its limit case ( $\e \to 0^{+}$ ) for countable-stated Markov decision processes (MDPs for short). Since the cost functional is time-inconsistent, it is impossible to find a global optimal strategy for both cases. Instead, for each case, we will prove the existence of time-inconstant equilibrium strategies which verify the so-called step-optimality. Moreover, we prove the convergence of $\e$ -equilibriums and the corresponding value functions as $\e \to 0^{+}$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Risk and Portfolio Optimization · Reinforcement Learning in Robotics