Deep Counterfactual Regret Minimization
Noam Brown, Adam Lerer, Sam Gross, Tuomas Sandholm

TL;DR
This paper introduces Deep CFR, a neural network-based approach that eliminates the need for game abstraction in large imperfect-information games, achieving strong results in poker.
Contribution
It presents the first successful non-tabular CFR variant using deep learning to directly approximate strategies in large games.
Findings
Deep CFR outperforms traditional tabular CFR in large poker games.
The method effectively approximates CFR behavior without manual abstraction.
Deep CFR demonstrates strong convergence properties in complex game settings.
Abstract
Counterfactual Regret Minimization (CFR) is the leading framework for solving large imperfect-information games. It converges to an equilibrium by iteratively traversing the game tree. In order to deal with extremely large games, abstraction is typically applied before running CFR. The abstracted game is solved with tabular CFR, and its solution is mapped back to the full game. This process can be problematic because aspects of abstraction are often manual and domain specific, abstraction algorithms may miss important strategic nuances of the game, and there is a chicken-and-egg problem because determining a good abstraction requires knowledge of the equilibrium of the game. This paper introduces Deep Counterfactual Regret Minimization, a form of CFR that obviates the need for abstraction by instead using deep neural networks to approximate the behavior of CFR in the full game. We show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Gambling Behavior and Treatments · Reinforcement Learning in Robotics
