Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess

Gregory Clark

arXiv:2110.01810·cs.AI·November 2, 2021

Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess

Gregory Clark

PDF

Open Access 1 Repo

TL;DR

This paper presents DSMCP, a novel planning algorithm for large imperfect information games, demonstrated by its success in reconnaissance blind chess, and introduces new inference and bandit techniques.

Contribution

It introduces DSMCP, a new Monte Carlo planning method using synopses for uncertainty, and develops Penumbra, a program that won the 2020 reconnaissance blind chess competition.

Findings

01

Penumbra outperformed 33 competitors in the 2020 competition.

02

DSMCP effectively handles large imperfect information games.

03

Algorithm variants with caution, paranoia, and new bandit methods were evaluated.

Abstract

This paper introduces deep synoptic Monte Carlo planning (DSMCP) for large imperfect information games. The algorithm constructs a belief state with an unweighted particle filter and plans via playouts that start at samples drawn from the belief state. The algorithm accounts for uncertainty by performing inference on "synopses," a novel stochastic abstraction of information states. DSMCP is the basis of the program Penumbra, which won the official 2020 reconnaissance blind chess competition versus 33 other programs. This paper also evaluates algorithm variants that incorporate caution, paranoia, and a novel bandit algorithm. Furthermore, it audits the synopsis features used in Penumbra with per-bit saliency statistics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

w-hat/penumbra
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Reinforcement Learning in Robotics