Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota, Ryan D'Orazio, Chun Kai Ling, David J. Wu, J. Zico, Kolter, Noam Brown

TL;DR
This paper introduces a novel approach to simplify two-player zero-sum games by using regularized equilibria, enabling decision-time planning without the issues of previous methods, and approximating Nash equilibria.
Contribution
It demonstrates that certain regularized equilibria can be computed as perfect-information problems, overcoming non-correspondence issues in existing approaches.
Findings
Regularized equilibria approximate Nash equilibria arbitrarily closely.
The new framework simplifies decision-time planning in two-player zero-sum games.
Existing algorithms' limitations are addressed by the proposed regularization approach.
Abstract
In their seminal work, Nayyar et al. (2013) showed that imperfect information can be abstracted away from common-payoff games by having players publicly announce their policies as they play. This insight underpins sound solvers and decision-time planning algorithms for common-payoff games. Unfortunately, a naive application of the same insight to two-player zero-sum games fails because Nash equilibria of the game with public policy announcements may not correspond to Nash equilibria of the original game. As a consequence, existing sound decision-time planning algorithms require complicated additional mechanisms that have unappealing properties. The main contribution of this work is showing that certain regularized equilibria do not possess the aforementioned non-correspondence problem -- thus, computing them can be treated as perfect-information problems. Because these regularized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExperimental Behavioral Economics Studies · Game Theory and Applications · Economic theories and models
