Monte Carlo Tree Search with Scalable Simulation Periods for Continuously Running Tasks
Seydou Ba, Takuya Hiraoka, Takashi Onishi, Toru Nakata, Yoshimasa, Tsuruoka

TL;DR
This paper introduces a method to adapt Monte Carlo Tree Search for continuously running tasks by dynamically adjusting simulation time, leading to improved decision-making in environments with continuous and discrete actions.
Contribution
The paper extends the HOOT method to optimize simulation time as a decision variable, enhancing MCTS performance in continuous decision environments.
Findings
Outperforms traditional MCTS in continuous decision tasks
Improves MCTS performance in most ALE tasks
Effectively balances simulation time and action selection
Abstract
Monte Carlo Tree Search (MCTS) is particularly adapted to domains where the potential actions can be represented as a tree of sequential decisions. For an effective action selection, MCTS performs many simulations to build a reliable tree representation of the decision space. As such, a bottleneck to MCTS appears when enough simulations cannot be performed between action selections. This is particularly highlighted in continuously running tasks, for which the time available to perform simulations between actions tends to be limited due to the environment's state constantly changing. In this paper, we present an approach that takes advantage of the anytime characteristic of MCTS to increase the simulation time when allowed. Our approach is to effectively balance the prospect of selecting an action with the time that can be spared to perform MCTS simulations before the next action…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety
