CFR-p: Counterfactual Regret Minimization with Hierarchical Policy   Abstraction, and its Application to Two-player Mahjong

Shiheng Wang

arXiv:2307.12087·cs.AI·July 25, 2023

CFR-p: Counterfactual Regret Minimization with Hierarchical Policy Abstraction, and its Application to Two-player Mahjong

Shiheng Wang

PDF

Open Access

TL;DR

This paper extends Counterfactual Regret Minimization (CFR) with hierarchical policy abstraction to two-player Mahjong, demonstrating its effectiveness in a more complex, imperfect information game and suggesting broader applicability.

Contribution

It introduces a hierarchical abstraction framework for CFR tailored to Mahjong, enhancing its scalability and applicability to complex imperfect information games.

Findings

01

Effective application of CFR to two-player Mahjong

02

Hierarchical abstraction improves computational efficiency

03

Framework generalizes to other imperfect information games

Abstract

Counterfactual Regret Minimization(CFR) has shown its success in Texas Hold'em poker. We apply this algorithm to another popular incomplete information game, Mahjong. Compared to the poker game, Mahjong is much more complex with many variants. We study two-player Mahjong by conducting game theoretical analysis and making a hierarchical abstraction to CFR based on winning policies. This framework can be generalized to other imperfect information games.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGambling Behavior and Treatments · Artificial Intelligence in Games · Sports Analytics and Performance