Markov games with frequent actions and incomplete information

Pierre Cardaliaguet (CEREMADE); Catherine Rainer (LM); Dinah Rosenberg; (GREGH); Nicolas Vieille (GREGH)

arXiv:1307.3365·math.OC·July 15, 2013

Markov games with frequent actions and incomplete information

Pierre Cardaliaguet (CEREMADE), Catherine Rainer (LM), Dinah Rosenberg, (GREGH), Nicolas Vieille (GREGH)

PDF

TL;DR

This paper analyzes a two-player zero-sum stochastic game with incomplete information, where players can act more frequently, and characterizes the limit value as actions become continuous, using advanced mathematical tools.

Contribution

It introduces a framework for Markov games with frequent actions and incomplete information, establishing the existence and characterization of the limit value as actions become continuous.

Findings

01

Existence of a limit value as the time between actions vanishes

02

Characterization of the limit value via an auxiliary optimization problem

03

Solution of a Hamilton-Jacobi equation for the limit value

Abstract

We study a two-player, zero-sum, stochastic game with incomplete information on one side in which the players are allowed to play more and more frequently. The informed player observes the realization of a Markov chain on which the payoffs depend, while the non-informed player only observes his opponent's actions. We show the existence of a limit value as the time span between two consecutive stages vanishes; this value is characterized through an auxiliary optimization problem and as the solution of an Hamilton-Jacobi equation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.