Stochastic Games with Minimally Bounded Action Costs
David Mguni

TL;DR
This paper studies a stochastic game where players face positive action costs, characterizes the equilibrium, and introduces a Q-learning method for unknown settings, with extensions to budget constraints.
Contribution
It proves the existence of a unique value and characterizes Markovian saddle point equilibria in stochastic games with minimally bounded costs, introducing a convergent Q-learning variant.
Findings
Unique value for the game established
Markovian saddle point equilibrium characterized
Q-learning variant converges almost surely
Abstract
In many multi-player interactions, players incur strictly positive costs each time they execute actions e.g. 'menu costs' or transaction costs in financial systems. Since acting at each available opportunity would accumulate prohibitively large costs, the resulting decision problem is one in which players must make strategic decisions about when to execute actions in addition to their choice of action. This paper analyses a discrete-time stochastic game (SG) in which players face minimally bounded positive costs for each action and influence the system using impulse controls. We prove SGs of two-sided impulse control have a unique value and characterise the saddle point equilibrium in which the players execute actions at strategically chosen times in accordance with Markovian strategies. We prove the game respects a dynamic programming principle and that the Markov perfect equilibrium…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Game Theory and Applications · Economic theories and models
