Safe Opponent Exploitation For Epsilon Equilibrium Strategies
Linus Jeary, Paolo Turrini

TL;DR
This paper introduces prime-safe opponent exploitation, a method that enhances safe exploitation strategies in large imperfect information games by redefining game value to ensure practical guarantees and improved empirical performance.
Contribution
It extends safe opponent exploitation to large domains by redefining game value as a worst-case payoff, enabling scalable and practical exploitation strategies.
Findings
Empirical improvements over existing safe exploitation algorithms.
Effective in large imperfect information games like poker.
Provides practical guarantees with weaker epsilon strategies.
Abstract
In safe opponent exploitation players hope to exploit their opponents' potentially sub-optimal strategies while guaranteeing at least the value of the game in expectation for themselves. Safe opponent exploitation algorithms have been successfully applied to small instances of two-player zero-sum imperfect information games, where Nash equilibrium strategies are typically known in advance. Current methods available to compute these strategies are however not scalable to desirable large domains of imperfect information such as No-Limit Texas Hold 'em (NLHE) poker, where successful agents rely on game abstractions in order to compute an equilibrium strategy approximation. This paper will extend the concept of safe opponent exploitation by introducing prime-safe opponent exploitation, in which we redefine the value of the game of a player to be the worst-case payoff their strategy could be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Game Theory and Applications
