Evolutionary Optimization of Deep Learning Agents for Sparrow Mahjong

Jim O'Connor; Derin Gezgin; Gary B. Parker

arXiv:2508.07522·cs.NE·August 12, 2025

Evolutionary Optimization of Deep Learning Agents for Sparrow Mahjong

Jim O'Connor, Derin Gezgin, Gary B. Parker

PDF

Open Access

TL;DR

This paper introduces Evo-Sparrow, a deep learning agent for Sparrow Mahjong trained via evolutionary strategies, outperforming rule-based agents and matching PPO performance, showcasing a hybrid approach for complex game AI.

Contribution

It presents a novel hybrid method combining deep learning and evolutionary optimization for decision-making in stochastic, partially observable games.

Findings

01

Outperforms rule-based agents in Sparrow Mahjong

02

Achieves performance comparable to PPO baseline

03

Demonstrates effectiveness of evolutionary strategies in complex games

Abstract

We present Evo-Sparrow, a deep learning-based agent for AI decision-making in Sparrow Mahjong, trained by optimizing Long Short-Term Memory (LSTM) networks using Covariance Matrix Adaptation Evolution Strategy (CMA-ES). Our model evaluates board states and optimizes decision policies in a non-deterministic, partially observable game environment. Empirical analysis conducted over a significant number of simulations demonstrates that our model outperforms both random and rule-based agents, and achieves performance comparable to a Proximal Policy Optimization (PPO) baseline, indicating strong strategic play and robust policy quality. By combining deep learning with evolutionary optimization, our approach provides a computationally effective alternative to traditional reinforcement learning and gradient-based optimization methods. This research contributes to the broader field of AI game…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Sports Analytics and Performance