Parallelizing Counterfactual Regret Minimization
Juho Kim, Tuomas Sandholm

TL;DR
This paper introduces a GPU-based parallelization framework for counterfactual regret minimization algorithms, significantly accelerating their performance in solving large imperfect-information games.
Contribution
It reframes CFR algorithms as linear algebra operations, enabling effective parallelization and broad applicability to various CFR variants.
Findings
GPU implementation of CFR is up to 10,000 times faster than CPU-based implementations.
The framework applies to multiple CFR variants, including CFR+ and discounted CFR.
Parallelization dramatically reduces training and evaluation times for large game models.
Abstract
Parallelization has played an instrumental role in the field of artificial intelligence (AI), drastically reducing the time taken to train and evaluate large AI models. In contrast to its impact in the broader field of AI, applying parallelization to computational game solving is relatively unexplored, despite its great potential. In this paper, we parallelize the family of counterfactual regret minimization (CFR) algorithms, which were central to important breakthroughs for solving large imperfect-information games. We present a generalized parallelization framework, reframing CFR as a series of linear algebra operations. Then, existing techniques for parallelizing linear algebra operations can be applied to accelerate CFR. We also describe how our technique can be applied to other tabular members of the CFR family of algorithms, including the state-of-the-art, such as CFR+, discounted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
