Deep (Predictive) Discounted Counterfactual Regret Minimization

Hang Xu; Kai Li; Haobo Fu; Qiang Fu; Junliang Xing; Jian Cheng

arXiv:2511.08174·cs.LG·November 12, 2025

Deep (Predictive) Discounted Counterfactual Regret Minimization

Hang Xu, Kai Li, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng

PDF

Open Access 1 Video

TL;DR

This paper introduces a neural CFR algorithm that efficiently approximates advanced CFR variants, leading to faster convergence and stronger performance in imperfect-information games.

Contribution

It presents a novel model-free neural CFR method that overcomes limitations of existing approaches in integrating advanced CFR variants.

Findings

01

Faster convergence in typical imperfect-information games

02

Stronger adversarial performance in large poker game

03

Effective approximation of advanced CFR variants

Abstract

Counterfactual regret minimization (CFR) is a family of algorithms for effectively solving imperfect-information games. To enhance CFR's applicability in large games, researchers use neural networks to approximate its behavior. However, existing methods are mainly based on vanilla CFR and struggle to effectively integrate more advanced CFR variants. In this work, we propose an efficient model-free neural CFR algorithm, overcoming the limitations of existing methods in approximating advanced CFR variants. At each iteration, it collects variance-reduced sampled advantages based on a value network, fits cumulative advantages by bootstrapping, and applies discounting and clipping operations to simulate the update mechanisms of advanced CFR variants. Experimental results show that, compared with model-free neural algorithms, it exhibits faster convergence in typical imperfect-information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Deep (Predictive) Discounted Counterfactual Regret Minimization· underline

Taxonomy

TopicsArtificial Intelligence in Games · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics