Zero-sum risk-sensitive continuous-time stochastic games with unbounded   payoff and transition rates and Borel spaces

Junyu Zhang; Xianping Guo; Li Xia

arXiv:2103.04057·math.OC·March 9, 2021·1 cites

Zero-sum risk-sensitive continuous-time stochastic games with unbounded payoff and transition rates and Borel spaces

Junyu Zhang, Xianping Guo, Li Xia

PDF

Open Access

TL;DR

This paper investigates a complex continuous-time zero-sum stochastic game with unbounded rates and rewards, establishing existence of solutions, Nash equilibria, and developing a convergent value iteration algorithm.

Contribution

It introduces a novel approach to handle unbounded rates and rewards in risk-sensitive stochastic games, proving existence and uniqueness of solutions and equilibria.

Findings

01

Existence of a solution to the Shapley equation under unbounded conditions

02

Proof of Nash equilibrium existence and value characterization

03

Development and convergence proof of a value iteration algorithm

Abstract

We study a finite-horizon two-person zero-sum risk-sensitive stochastic game for continuous-time Markov chains and Borel state and action spaces, in which payoff rates, transition rates and terminal reward functions are allowed to be unbounded from below and from above and the policies can be history-dependent. Under suitable conditions, we establish the existence of a solution to the corresponding Shapley equation (SE) by an approximation technique. Then, by the SE and the extension of the Dynkin's formula, we prove the existence of a Nash equilibrium and verify that the value of the stochastic game is the unique solution to the SE. Moreover, we develop a value iteration-type algorithm for approaching to the value of the stochastic game. The convergence of the algorithm is proved by a special contraction operator in our risk-sensitive stochastic game. Finally, we demonstrate our main…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic theories and models · Stochastic processes and financial applications · Risk and Portfolio Optimization