Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions

Weirui Ye; Pieter Abbeel; Yang Gao

arXiv:2210.12628·cs.AI·October 25, 2022·1 cites

Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions

Weirui Ye, Pieter Abbeel, Yang Gao

PDF

Open Access 1 Video

TL;DR

This paper introduces V-MCTS, an adaptive variant of Monte-Carlo tree search that allocates more computation to harder states, reducing overall search time while maintaining performance.

Contribution

It presents a novel adaptive search method with theoretical bounds, improving efficiency of MCTS in complex domains.

Findings

01

Achieves comparable performance to standard MCTS

02

Reduces search time by over 50% on average

03

Effective in Go and Atari game environments

Abstract

One of the most important AI research questions is to trade off computation versus performance since ``perfect rationality" exists in theory but is impossible to achieve in practice. Recently, Monte-Carlo tree search (MCTS) has attracted considerable attention due to the significant performance improvement in various challenging domains. However, the expensive time cost during search severely restricts its scope for applications. This paper proposes the Virtual MCTS (V-MCTS), a variant of MCTS that spends more search time on harder states and less search time on simpler states adaptively. We give theoretical bounds of the proposed method and evaluate the performance and computations on $9 \times 9$ Go board games and Atari games. Experiments show that our method can achieve comparable performances to the original search algorithm while requiring less than $50%$ search time on average.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions· slideslive

Taxonomy

TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Video Analysis and Summarization

MethodsMonte-Carlo Tree Search