AlphaZero Gomoku

Wen Liang; Chao Yu; Brian Whiteaker; Inyoung Huh; Hua Shao; Youzhi; Liang

arXiv:2309.01294·cs.AI·September 6, 2023

AlphaZero Gomoku

Wen Liang, Chao Yu, Brian Whiteaker, Inyoung Huh, Hua Shao, Youzhi, Liang

PDF

Open Access

TL;DR

This paper extends AlphaZero's deep reinforcement learning and Monte Carlo tree search approach to the game of Gomoku, demonstrating its adaptability and effectiveness beyond traditional games like Go, chess, and shogi.

Contribution

It introduces AlphaZero to Gomoku, addressing inherent game biases and showcasing its versatility in mastering new strategic board games.

Findings

01

AlphaZero successfully learned Gomoku strategies.

02

The method achieved balanced gameplay despite initial biases.

03

Demonstrated adaptability of AlphaZero to new game environments.

Abstract

In the past few years, AlphaZero's exceptional capability in mastering intricate board games has garnered considerable interest. Initially designed for the game of Go, this revolutionary algorithm merges deep learning techniques with the Monte Carlo tree search (MCTS) to surpass earlier top-tier methods. In our study, we broaden the use of AlphaZero to Gomoku, an age-old tactical board game also referred to as "Five in a Row." Intriguingly, Gomoku has innate challenges due to a bias towards the initial player, who has a theoretical advantage. To add value, we strive for a balanced game-play. Our tests demonstrate AlphaZero's versatility in adapting to games other than Go. MCTS has become a predominant algorithm for decision processes in intricate scenarios, especially board games. MCTS creates a search tree by examining potential future actions and uses random sampling to predict…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Educational Games and Gamification

MethodsAlphaZero