PokerGPT: An End-to-End Lightweight Solver for Multi-Player Texas   Hold'em via Large Language Model

Chenghao Huang; Yanbo Cao; Yinlong Wen; Tao Zhou; Yanru Zhang

arXiv:2401.06781·cs.AI·January 17, 2024·5 cites

PokerGPT: An End-to-End Lightweight Solver for Multi-Player Texas Hold'em via Large Language Model

Chenghao Huang, Yanbo Cao, Yinlong Wen, Tao Zhou, Yanru Zhang

PDF

Open Access

TL;DR

PokerGPT introduces a lightweight, end-to-end LLM-based solver for multi-player Texas Hold'em, achieving high win rates with efficient training and interaction, surpassing prior methods in speed and size.

Contribution

This work presents the first lightweight LLM-based poker solver capable of multi-player Texas Hold'em, using reinforcement learning with human feedback and prompt engineering for effective decision-making.

Findings

01

PokerGPT outperforms previous approaches in win rate.

02

It requires less training time and smaller model size.

03

Provides fast and human-interactive decision advice.

Abstract

Poker, also known as Texas Hold'em, has always been a typical research target within imperfect information games (IIGs). IIGs have long served as a measure of artificial intelligence (AI) development. Representative prior works, such as DeepStack and Libratus heavily rely on counterfactual regret minimization (CFR) to tackle heads-up no-limit Poker. However, it is challenging for subsequent researchers to learn CFR from previous models and apply it to other real-world applications due to the expensive computational cost of CFR iterations. Additionally, CFR is difficult to apply to multi-player games due to the exponential growth of the game tree size. In this work, we introduce PokerGPT, an end-to-end solver for playing Texas Hold'em with arbitrary number of players and gaining high win rates, established on a lightweight large language model (LLM). PokerGPT only requires simple textual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Digital Games and Media

MethodsSparse Evolutionary Training