Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games
Youzhi Zhang, Bo An, Jakub \v{C}ern\'y

TL;DR
This paper introduces a new algorithm for efficiently computing Team-Maxmin Equilibria with Coordination in large zero-sum multiplayer extensive-form games, which has applications in card games and real-world scenarios like forest protection.
Contribution
The paper presents a hybrid strategy representation, a column-generation algorithm with guaranteed convergence, and an exact multilinear term representation, significantly improving computational efficiency.
Findings
Algorithm is several orders of magnitude faster than previous methods.
Successfully computes equilibria in larger, complex games.
Demonstrates practical applications in card games and real-world scenarios.
Abstract
Computational game theory has many applications in the modern world in both adversarial situations and the optimization of social good. While there exist many algorithms for computing solutions in two-player interactions, finding optimal strategies in multiplayer interactions efficiently remains an open challenge. This paper focuses on computing the multiplayer Team-Maxmin Equilibrium with Coordination device (TMECor) in zero-sum extensive-form games. TMECor models scenarios when a team of players coordinates ex ante against an adversary. Such situations can be found in card games (e.g., in Bridge and Poker), when a team works together to beat a target player but communication is prohibited; and also in real world, e.g., in forest-protection operations, when coordinated groups have limited contact during interdicting illegal loggers. The existing algorithms struggle to find a TMECor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Opinion Dynamics and Social Influence · Reinforcement Learning in Robotics
