Multiple Tree for Partially Observable Monte-Carlo Tree Search
David Auger (LRI, INRIA Saclay - Ile de France)

TL;DR
This paper introduces a Monte-Carlo tree search algorithm leveraging bandit methods to compute approximate Nash equilibria in partially observable games, demonstrated on phantom tic-tac-toe.
Contribution
It presents a novel algorithm that efficiently computes strong strategies for partially observable games using Monte-Carlo tree search and bandit techniques.
Findings
Effective computation of approximate Nash equilibria.
Strong strategies achieved in phantom tic-tac-toe.
Algorithm shows promising results in partial observability settings.
Abstract
We propose an algorithm for computing approximate Nash equilibria of partially observable games using Monte-Carlo tree search based on recent bandit methods. We obtain experimental results for the game of phantom tic-tac-toe, showing that strong strategies can be efficiently computed by our algorithm.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Artificial Intelligence in Games · Reinforcement Learning in Robotics
