Policy Abstraction and Nash Refinement in Tree-Exploiting PSRO
Christine Konicki, Mithun Chakraborty, Michael P. Wellman

TL;DR
This paper advances Tree-exploiting PSRO by introducing scalable game tree representations and refined equilibrium strategies, enabling faster convergence in complex imperfect-information games.
Contribution
It proposes a scalable empirical game tree representation and a generalized backward induction algorithm for refined Nash equilibria in TE-PSRO.
Findings
TE-PSRO converges faster with SPE-based strategy generation.
The approach handles complex imperfect-information games efficiently.
Empirical results show reasonable time and memory usage.
Abstract
Policy Space Response Oracles (PSRO) interleaves empirical game-theoretic analysis with deep reinforcement learning (DRL) to solve games too complex for traditional analytic methods. Tree-exploiting PSRO (TE-PSRO) is a variant of this approach that iteratively builds a coarsened empirical game model in extensive form using data obtained from querying a simulator that represents a detailed description of the game. We make two main methodological advances to TE-PSRO that enhance its applicability to complex games of imperfect information. First, we introduce a scalable representation for the empirical game tree where edges correspond to implicit policies learned through DRL. These policies cover conditions in the underlying game abstracted in the game model, supporting sustainable growth of the tree over epochs. Second, we leverage extensive form in the empirical model by employing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Internet Traffic Analysis and Secure E-voting
