Learning to Play Blackjack: A Curriculum Learning Perspective

Amirreza Alasti; Efe Erdal; Y\"ucel Celik; Theresa Eimer

arXiv:2604.00076·cs.LG·April 3, 2026

Learning to Play Blackjack: A Curriculum Learning Perspective

Amirreza Alasti, Efe Erdal, Y\"ucel Celik, Theresa Eimer

PDF

TL;DR

This paper introduces a curriculum learning framework using Large Language Models to improve reinforcement learning agents in Blackjack, resulting in faster training and better performance.

Contribution

It presents a novel LLM-guided curriculum approach for RL that enhances learning efficiency and effectiveness in complex environments like Blackjack.

Findings

01

DQN agent's win rate increased from 43.97% to 47.41%.

02

Bust rate reduced from 32.9% to 28.0%.

03

Training workflow accelerated by over 74%.

Abstract

Reinforcement Learning (RL) agents often struggle with efficiency and performance in complex environments. We propose a novel framework that uses a Large Language Model (LLM) to dynamically generate a curriculum over available actions, enabling the agent to incorporate each action individually. We apply this framework to the game of Blackjack, where the LLM creates a multi-stage training path that progressively introduces complex actions to a Tabular Q-Learning and a Deep Q-Network (DQN) agent. Our evaluation in a realistic 8-deck simulation over 10 independent runs demonstrates significant performance gains over standard training methods. The curriculum-based approach increases the DQN agent's average win rate from 43.97% to 47.41%, reduces the average bust rate from 32.9% to 28.0%, and accelerates the overall workflow by over 74%, with the agent's full training completing faster than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.