Accelerating Deep Neural Network guided MCTS using Adaptive Parallelism
Yuan Meng, Qian Wang, Tianxin Zu, Viktor Prasanna

TL;DR
This paper introduces an adaptive parallelism framework for DNN-guided MCTS that dynamically selects optimal parallel schemes and communication batch sizes, significantly improving performance on CPU and GPU systems.
Contribution
It develops a novel adaptive parallel scheme selection method for MCTS, optimizing performance tradeoffs and interface communication in DNN-MCTS systems.
Findings
Achieves 1.5x to 3x speedup over baseline methods.
Effectively balances parallelism tradeoffs for CPU and GPU platforms.
Demonstrates improved efficiency on Alphazero board game benchmarks.
Abstract
Deep Neural Network guided Monte-Carlo Tree Search (DNN-MCTS) is a powerful class of AI algorithms. In DNN-MCTS, a Deep Neural Network model is trained collaboratively with a dynamic Monte-Carlo search tree to guide the agent towards actions that yields the highest returns. While the DNN operations are highly parallelizable, the search tree operations involved in MCTS are sequential and often become the system bottleneck. Existing MCTS parallel schemes on shared-memory multi-core CPU platforms either exploit data parallelism but sacrifice memory access latency, or take advantage of local cache for low-latency memory accesses but constrain the tree search to a single thread. In this work, we analyze the tradeoff of these parallel schemes and develop performance models for both parallel schemes based on the application and hardware parameters. We propose a novel implementation that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)
