Aligning Superhuman AI with Human Behavior: Chess as a Model System
Reid McIlroy-Young, Siddhartha Sen, Jon Kleinberg, Ashton, Anderson

TL;DR
This paper introduces Maia, a customized AlphaZero model trained on human chess data, which better predicts human moves and mistakes, advancing the understanding of human-AI collaboration in decision-making.
Contribution
The paper develops Maia, a novel AI model trained on human chess games, demonstrating improved prediction of human moves and errors compared to existing engines.
Findings
Maia predicts human moves more accurately than traditional chess engines.
A neural network effectively predicts large human mistakes in chess.
Results show potential for AI systems designed for human collaboration.
Abstract
As artificial intelligence becomes increasingly intelligent---in some cases, achieving superhuman performance---there is growing potential for humans to learn from and collaborate with algorithms. However, the ways in which AI systems approach problems are often different from the ways people do, and thus may be uninterpretable and hard to learn from. A crucial step in bridging this gap between human and artificial intelligence is modeling the granular actions that constitute human behavior, rather than simply matching aggregate human performance. We pursue this goal in a model system with a long history in artificial intelligence: chess. The aggregate performance of a chess player unfolds as they make decisions over the course of a game. The hundreds of millions of games played online by players at every skill level form a rich source of data in which these decisions, and their exact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Reinforcement Learning in Robotics · Artificial Intelligence in Games
MethodsAlphaZero
