Batch Monte Carlo Tree Search

Tristan Cazenave

arXiv:2104.04278·cs.AI·April 12, 2021·1 cites

Batch Monte Carlo Tree Search

Tristan Cazenave

PDF

Open Access

TL;DR

This paper introduces batched Monte Carlo Tree Search algorithms that leverage GPU efficiency for neural network inferences, combining search trees and transposition tables, and evaluates several heuristics in the game of Go.

Contribution

It proposes a novel MCTS algorithm using both search trees and transposition tables with batch inference, and analyzes multiple heuristics to enhance search performance.

Findings

01

Batched MCTS significantly speeds up inference on GPUs.

02

Combining search trees with transposition tables improves search efficiency.

03

Heuristics like the $PU$, Virtual Mean, Last Iteration, and Second Move enhance Go gameplay.

Abstract

Making inferences with a deep neural network on a batch of states is much faster with a GPU than making inferences on one state after another. We build on this property to propose Monte Carlo Tree Search algorithms using batched inferences. Instead of using either a search tree or a transposition table we propose to use both in the same algorithm. The transposition table contains the results of the inferences while the search tree contains the statistics of Monte Carlo Tree Search. We also propose to analyze multiple heuristics that improve the search: the $μ$ FPU, the Virtual Mean, the Last Iteration and the Second Move heuristics. They are evaluated for the game of Go using a MobileNet neural network.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Time Series Analysis and Forecasting · Sports Analytics and Performance