Bayesian Opponent Exploitation in Imperfect-Information Games

Sam Ganzfried; Qingyun Sun

arXiv:1603.03491·cs.GT·June 29, 2018

Bayesian Opponent Exploitation in Imperfect-Information Games

Sam Ganzfried, Qingyun Sun

PDF

TL;DR

This paper introduces the first exact polynomial-time algorithm for opponent exploitation in a natural class of imperfect-information games, improving over prior approximation methods and demonstrating practical efficiency.

Contribution

It presents the first exact algorithm for Bayesian opponent exploitation in certain imperfect-information games, advancing beyond previous approximate approaches.

Findings

01

Algorithm runs quickly in practice

02

Outperforms prior approximation methods

03

Effective for a natural class of imperfect-information games

Abstract

Two fundamental problems in computational game theory are computing a Nash equilibrium and learning to exploit opponents given observations of their play (opponent exploitation). The latter is perhaps even more important than the former: Nash equilibrium does not have a compelling theoretical justification in game classes other than two-player zero-sum, and for all games one can potentially do better by exploiting perceived weaknesses of the opponent than by following a static equilibrium strategy throughout the match. The natural setting for opponent exploitation is the Bayesian setting where we have a prior model that is integrated with observations to create a posterior opponent model that we respond to. The most natural, and a well-studied prior distribution is the Dirichlet distribution. An exact polynomial-time algorithm is known for best-responding to the posterior distribution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.