Superior Computer Chess with Model Predictive Control, Reinforcement   Learning, and Rollout

Atharva Gundawar; Yuchao Li; Dimitri Bertsekas

arXiv:2409.06477·cs.AI·September 11, 2024

Superior Computer Chess with Model Predictive Control, Reinforcement Learning, and Rollout

Atharva Gundawar, Yuchao Li, Dimitri Bertsekas

PDF

Open Access

TL;DR

This paper introduces a novel architecture combining model predictive control, rollout, and reinforcement learning to enhance computer chess engines' performance, applicable across various engine strengths and improving move selection accuracy.

Contribution

The paper presents a new move selection architecture integrating existing engines with MPC/RL techniques, significantly boosting their evaluation and decision-making capabilities.

Findings

01

Architecture improves engine performance across various strengths

02

Enhanced move selection with a one-move lookahead and nominal opponent

03

Complex schemes with multistep lookahead further improve results

Abstract

In this paper we apply model predictive control (MPC), rollout, and reinforcement learning (RL) methodologies to computer chess. We introduce a new architecture for move selection, within which available chess engines are used as components. One engine is used to provide position evaluations in an approximation in value space MPC/RL scheme, while a second engine is used as nominal opponent, to emulate or approximate the moves of the true opponent player. We show that our architecture improves substantially the performance of the position evaluation engine. In other words our architecture provides an additional layer of intelligence, on top of the intelligence of the engines on which it is based. This is true for any engine, regardless of its strength: top engines such as Stockfish and Komodo Dragon (of varying strengths), as well as weaker engines. Structurally, our basic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Sports Analytics and Performance