Real-Time Parallel Counterfactual Regret Minimization
Boning Li, Longbo Huang

TL;DR
This paper introduces Parallel CFR, a novel parallelization framework for real-time depth-limited CFR solving in large imperfect-information games, achieving significant speedups on a single desktop device.
Contribution
The paper presents the first parallelization framework for real-time CFR that integrates pruning, abstraction, and neural network inference, enabling faster decision-making in complex games.
Findings
Achieves 3.3-3.4x speedup over single-threaded CFR
Per-iteration time of approximately 47-54 ms on large game trees
Operates efficiently on a single desktop device, enabling hundreds of CFR iterations within real-time constraints
Abstract
Counterfactual Regret Minimization (CFR) is the dominant algorithmic family for solving large imperfect-information games, underpinning breakthroughs such as Libratus and Pluribus in No-Limit Texas Hold'em poker. In real-time game-playing systems, the solver must compute a near-equilibrium strategy within a strict time budget of only a few seconds per decision, and the number of CFR iterations completed in this window directly determines play strength. We present \textbf{Parallel CFR}, the first parallelization framework for real-time depth-limited CFR solving that seamlessly integrates pruning, abstraction, and advanced CFR variants. We decompose each CFR iteration into a pipeline of seven stages and identify two orthogonal dimensions of parallelism: \emph{by information set} and \emph{by tree node}. Leaf node evaluation is offloaded to GPUs via batched neural network inference,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
