AlphaZero-Inspired Game Learning: Faster Training by Using MCTS Only at   Test Time

Johannes Scheiermann; Wolfgang Konen

arXiv:2204.13307·cs.LG·September 27, 2022

AlphaZero-Inspired Game Learning: Faster Training by Using MCTS Only at Test Time

Johannes Scheiermann, Wolfgang Konen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach combining Monte Carlo Tree Search (MCTS) with temporal difference learning, applied only at test time, enabling low-resource agents to outperform strong game programs on complex games.

Contribution

It presents a new architecture that integrates MCTS with TD learning agents only during testing, reducing training computational demands while maintaining high performance.

Findings

01

Agent beats strong Othello program Edax up to level 7

02

Achieves competitive results on ConnectFour and Rubik's Cube

03

Operates effectively on standard hardware without GPUs or TPUs

Abstract

Recently, the seminal algorithms AlphaGo and AlphaZero have started a new era in game learning and deep reinforcement learning. While the achievements of AlphaGo and AlphaZero - playing Go and other complex games at super human level - are truly impressive, these architectures have the drawback that they require high computational resources. Many researchers are looking for methods that are similar to AlphaZero, but have lower computational demands and are thus more easily reproducible. In this paper, we pick an important element of AlphaZero - the Monte Carlo Tree Search (MCTS) planning stage - and combine it with temporal difference (TD) learning agents. We wrap MCTS for the first time around TD n-tuple networks and we use this wrapping only at test time to create versatile agents that keep at the same time the computational demands low. We apply this new architecture to several…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WolfgangKonen/PapersWithCodeOthello
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Digital Games and Media

MethodsAlphaZero