Multiple Tree for Partially Observable Monte-Carlo Tree Search

David Auger (LRI; INRIA Saclay - Ile de France)

arXiv:1102.1580·cs.GT·February 9, 2011

Multiple Tree for Partially Observable Monte-Carlo Tree Search

David Auger (LRI, INRIA Saclay - Ile de France)

PDF

Open Access

TL;DR

This paper introduces a Monte-Carlo tree search algorithm leveraging bandit methods to compute approximate Nash equilibria in partially observable games, demonstrated on phantom tic-tac-toe.

Contribution

It presents a novel algorithm that efficiently computes strong strategies for partially observable games using Monte-Carlo tree search and bandit techniques.

Findings

01

Effective computation of approximate Nash equilibria.

02

Strong strategies achieved in phantom tic-tac-toe.

03

Algorithm shows promising results in partial observability settings.

Abstract

We propose an algorithm for computing approximate Nash equilibria of partially observable games using Monte-Carlo tree search based on recent bandit methods. We obtain experimental results for the game of phantom tic-tac-toe, showing that strong strategies can be efficiently computed by our algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Artificial Intelligence in Games · Reinforcement Learning in Robotics