Algorithms for zero-sum stochastic games with the risk-sensitive average   criterion

Fang Chen; Xianping Guo; Xin Guo; Junyu Zhang

arXiv:2505.04546·math.OC·May 8, 2025

Algorithms for zero-sum stochastic games with the risk-sensitive average criterion

Fang Chen, Xianping Guo, Xin Guo, Junyu Zhang

PDF

Open Access

TL;DR

This paper develops algorithms to compute approximate values and saddle points in zero-sum risk-sensitive average stochastic games with finite states and actions, supported by convergence proofs and a practical energy management example.

Contribution

It introduces the irreducibility coefficient, establishes its equivalence to irreducibility, and develops iterative algorithms for approximating values and saddle points.

Findings

01

Algorithms converge to $ ext{ε}$-approximations of the value.

02

Finite-step algorithm finds $ ext{ε}$-saddle points.

03

Numerical example demonstrates practical applicability.

Abstract

This paper is an attempt to compute the value and saddle points of zero-sum risk-sensitive average stochastic games. For the average games with finite states and actions, we first introduce the so-called irreducibility coefficient and then establish its equivalence to the irreducibility condition. Using this equivalence,we develop an iteration algorithm to compute $ε$ -approximations of the value (for any given $ε > 0$ ) and show its convergence. Based on $ε$ -approximations of the value and the irreducibility coefficient, we further propose another iteration algorithm, which is proved to obtain $ε$ -saddle points in finite steps. Finally, a numerical example of energy management in smart grids is provided to illustrate our results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Stochastic processes and financial applications · Game Theory and Applications