Partial-Information Q-Learning for General Two-Player Stochastic Games

Negash Medhin; Andrew Papanicolaou; Marwen Zrida

arXiv:2302.10830·cs.GT·February 22, 2023·1 cites

Partial-Information Q-Learning for General Two-Player Stochastic Games

Negash Medhin, Andrew Papanicolaou, Marwen Zrida

PDF

Open Access

TL;DR

This paper introduces a partial-information Nash Q-learning algorithm for two-player stochastic games, proving its convergence to Nash equilibria without requiring players to know opponents' strategies, simplifying implementation.

Contribution

It presents the first convergence proof for partial-information Q-learning in general 2-player stochastic games, avoiding complex equilibrium computations at each step.

Findings

01

Partial-information Q-learning converges to Nash equilibria.

02

Performance comparable to full-information Q-learning and fictitious play.

03

Simplifies implementation by not requiring Nash equilibrium calculations each iteration.

Abstract

In this article we analyze a partial-information Nash Q-learning algorithm for a general 2-player stochastic game. Partial information refers to the setting where a player does not know the strategy or the actions taken by the opposing player. We prove convergence of this partially informed algorithm for general 2-player games with finitely many states and actions, and we confirm that the limiting strategy is in fact a full-information Nash equilibrium. In implementation, partial information offers simplicity because it avoids computation of Nash equilibria at every time step. In contrast, full-information Q-learning uses the Lemke-Howson algorithm to compute Nash equilibria at every time step, which can be an effective approach but requires several assumptions to prove convergence and may have runtime error if Lemke-Howson encounters degeneracy. In simulations, the partial information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExperimental Behavioral Economics Studies · Auction Theory and Applications · Game Theory and Applications