Superior Computer Chess with Model Predictive Control, Reinforcement Learning, and Rollout
Atharva Gundawar, Yuchao Li, Dimitri Bertsekas

TL;DR
This paper introduces a novel architecture combining model predictive control, rollout, and reinforcement learning to enhance computer chess engines' performance, applicable across various engine strengths and improving move selection accuracy.
Contribution
The paper presents a new move selection architecture integrating existing engines with MPC/RL techniques, significantly boosting their evaluation and decision-making capabilities.
Findings
Architecture improves engine performance across various strengths
Enhanced move selection with a one-move lookahead and nominal opponent
Complex schemes with multistep lookahead further improve results
Abstract
In this paper we apply model predictive control (MPC), rollout, and reinforcement learning (RL) methodologies to computer chess. We introduce a new architecture for move selection, within which available chess engines are used as components. One engine is used to provide position evaluations in an approximation in value space MPC/RL scheme, while a second engine is used as nominal opponent, to emulate or approximate the moves of the true opponent player. We show that our architecture improves substantially the performance of the position evaluation engine. In other words our architecture provides an additional layer of intelligence, on top of the intelligence of the engines on which it is based. This is true for any engine, regardless of its strength: top engines such as Stockfish and Komodo Dragon (of varying strengths), as well as weaker engines. Structurally, our basic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Sports Analytics and Performance
