Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search

Berk Yilmaz; Junyu Hu; Jinsong Liu

arXiv:2506.15880·cs.AI·June 23, 2025

Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search

Berk Yilmaz, Junyu Hu, Jinsong Liu

PDF

Open Access

TL;DR

This paper develops a Deep Reinforcement Learning system for Xiangqi that combines neural networks with Monte Carlo Tree Search to improve strategic gameplay and self-learning in this complex Chinese Chess variant.

Contribution

It introduces a novel DRL-MCTS framework tailored for Xiangqi, addressing its unique rules and high complexity, advancing AI in culturally significant strategy games.

Findings

01

Achieved strategic self-play through neural network-guided MCTS

02

Improved decision-making accuracy in Xiangqi

03

Demonstrated adaptability of DRL-MCTS to complex rule systems

Abstract

This paper presents a Deep Reinforcement Learning (DRL) system for Xiangqi (Chinese Chess) that integrates neural networks with Monte Carlo Tree Search (MCTS) to enable strategic self-play and self-improvement. Addressing the underexplored complexity of Xiangqi, including its unique board layout, piece movement constraints, and victory conditions, our approach combines policy-value networks with MCTS to simulate move consequences and refine decision-making. By overcoming challenges such as Xiangqi's high branching factor and asymmetrical piece dynamics, our work advances AI capabilities in culturally significant strategy games while providing insights for adapting DRL-MCTS frameworks to domain-specific rule systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Sports Analytics and Performance