Solving Imperfect-Information Games via Discounted Regret Minimization

Noam Brown; Tuomas Sandholm

arXiv:1809.04040·cs.GT·February 22, 2019

Solving Imperfect-Information Games via Discounted Regret Minimization

Noam Brown, Tuomas Sandholm

PDF

5 Repos

TL;DR

This paper introduces novel variants of counterfactual regret minimization (CFR) that incorporate discounting, reweighting, and optimistic regret matching, significantly improving performance in solving large imperfect-information games.

Contribution

The paper presents new CFR variants that outperform CFR+ and are compatible with modern pruning and sampling techniques, advancing the state-of-the-art in imperfect-information game solving.

Findings

01

New CFR variants outperform CFR+ in all tested games.

02

Some variants are compatible with game pruning techniques.

03

One variant supports sampling in the game tree.

Abstract

Counterfactual regret minimization (CFR) is a family of iterative algorithms that are the most popular and, in practice, fastest approach to approximately solving large imperfect-information games. In this paper we introduce novel CFR variants that 1) discount regrets from earlier iterations in various ways (in some cases differently for positive and negative regrets), 2) reweight iterations in various ways to obtain the output strategies, 3) use a non-standard regret minimizer and/or 4) leverage "optimistic regret matching". They lead to dramatically improved performance in many settings. For one, we introduce a variant that outperforms CFR+, the prior state-of-the-art algorithm, in every game tested, including large-scale realistic settings. CFR+ is a formidable benchmark: no other algorithm has been able to outperform it. Finally, we show that, unlike CFR+, many of the important new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning