Adapting to game trees in zero-sum imperfect information games

C\^ome Fiegel; Pierre M\'enard; Tadashi Kozuno; R\'emi Munos; Vianney; Perchet; Michal Valko

arXiv:2212.12567·stat.ML·February 16, 2023·1 cites

Adapting to game trees in zero-sum imperfect information games

C\^ome Fiegel, Pierre M\'enard, Tadashi Kozuno, R\'emi Munos, Vianney, Perchet, Michal Valko

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

This paper investigates learning near-optimal strategies in zero-sum imperfect information games through self-play, establishing lower bounds and proposing two algorithms that adapt to game structure and observations.

Contribution

It provides a problem-independent lower bound on sample complexity and introduces two FTRL algorithms, one requiring prior game structure knowledge and the other adapting online.

Findings

01

Lower bound of (H(A_X+B_Y))/(. )^2 on realizations needed

02

Balanced FTRL matches the lower bound but needs game structure knowledge

03

Adaptive FTRL achieves near-optimal sample complexity without prior structure knowledge

Abstract

Imperfect information games (IIG) are games in which each player only partially observes the current game state. We study how to learn $ϵ$ -optimal strategies in a zero-sum IIG through self-play with trajectory feedback. We give a problem-independent lower bound $O (H (A_{X} + B_{Y}) / ϵ^{2})$ on the required number of realizations to learn these strategies with high probability, where $H$ is the length of the game, $A_{X}$ and $B_{Y}$ are the total number of actions for the two players. We also propose two Follow the Regularized leader (FTRL) algorithms for this setting: Balanced FTRL which matches this lower bound, but requires the knowledge of the information set structure beforehand to define the regularization; and Adaptive FTRL which needs $O (H^{2} (A_{X} + B_{Y}) / ϵ^{2})$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anon17893/iig-tree-adaptation
noneOfficial

Datasets

misovalko/my-research-papers
dataset· 21 dl
21 dl

Videos

Adapting to game trees in zero-sum imperfect information games· slideslive

Taxonomy

TopicsArtificial Intelligence in Games · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics