Time-inconsistent Risk-sensitive Equilibrium for Countable-stated Markov Decision Processes
Hongwei Mei

TL;DR
This paper investigates time-inconsistent risk-sensitive control in countable-state Markov decision processes, establishing equilibrium strategies and their convergence as risk sensitivity diminishes.
Contribution
It introduces the concept of time-inconsistent equilibrium strategies for risk-sensitive MDPs and proves their existence and convergence in the limit case.
Findings
Existence of time-inconsistent equilibrium strategies.
Convergence of $ ext{e}$-equilibriums as $ ext{e} o 0^+$.
Validation of step-optimality in the control problem.
Abstract
This paper is devoted to solving a time-inconsistent risk-sensitive control problem with parameter and its limit case () for countable-stated Markov decision processes (MDPs for short). Since the cost functional is time-inconsistent, it is impossible to find a global optimal strategy for both cases. Instead, for each case, we will prove the existence of time-inconstant equilibrium strategies which verify the so-called step-optimality. Moreover, we prove the convergence of -equilibriums and the corresponding value functions as .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Risk and Portfolio Optimization · Reinforcement Learning in Robotics
