Supervised and Reinforcement Learning from Observations in   Reconnaissance Blind Chess

Timo Bertram; Johannes F\"urnkranz; Martin M\"uller

arXiv:2208.02029·cs.AI·August 4, 2022

Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess

Timo Bertram, Johannes F\"urnkranz, Martin M\"uller

PDF

Open Access

TL;DR

This paper develops a reinforcement learning approach for Reconnaissance Blind Chess using only observations, combining supervised learning and self-play, achieving competitive performance without search or full state knowledge.

Contribution

It introduces a novel RL training method for RBC based on observations, inspired by AlphaGo, and demonstrates effective self-play improvements without search.

Findings

01

Achieved ELO of 1330 on RBC leaderboard

02

Self-play significantly improves agent performance

03

Agent performs well without search or full state assumptions

Abstract

In this work, we adapt a training approach inspired by the original AlphaGo system to play the imperfect information game of Reconnaissance Blind Chess. Using only the observations instead of a full description of the game state, we first train a supervised agent on publicly available game records. Next, we increase the performance of the agent through self-play with the on-policy reinforcement learning algorithm Proximal Policy Optimization. We do not use any search to avoid problems caused by the partial observability of game states and only use the policy network to generate moves when playing. With this approach, we achieve an ELO of 1330 on the RBC leaderboard, which places our agent at position 27 at the time of this writing. We see that self-play significantly improves performance and that the agent plays acceptably well without search and without making assumptions about the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSports Analytics and Performance · Artificial Intelligence in Games · Reinforcement Learning in Robotics