Reduced Space and Faster Convergence in Imperfect-Information Games via   Regret-Based Pruning

Noam Brown; Tuomas Sandholm

arXiv:1609.03234·cs.GT·September 13, 2016·2 cites

Reduced Space and Faster Convergence in Imperfect-Information Games via Regret-Based Pruning

Noam Brown, Tuomas Sandholm

PDF

Open Access

TL;DR

This paper introduces Total RBP, an enhancement to Regret-Based Pruning in CFR algorithms, which reduces space and accelerates convergence in solving large zero-sum imperfect-information games.

Contribution

It presents Total RBP, a novel pruning method that asymptotically eliminates suboptimal actions, leading to faster convergence and significant space savings in game-solving.

Findings

01

Achieves an order of magnitude reduction in space usage.

02

Provably prunes non-essential actions in zero-sum games.

03

Results improve with larger game sizes.

Abstract

Counterfactual Regret Minimization (CFR) is the most popular iterative algorithm for solving zero-sum imperfect-information games. Regret-Based Pruning (RBP) is an improvement that allows poorly-performing actions to be temporarily pruned, thus speeding up CFR. We introduce Total RBP, a new form of RBP that reduces the space requirements of CFR as actions are pruned. We prove that in zero-sum games it asymptotically prunes any action that is not part of a best response to some Nash equilibrium. This leads to provably faster convergence and lower space requirements. Experiments show that Total RBP results in an order of magnitude reduction in space, and the reduction factor increases with game size.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Evolutionary Algorithms and Applications · Reinforcement Learning in Robotics