Modeling the Centaur: Human-Machine Synergy in Sequential Decision Making
David Shoresh, Yonatan Loewenstein

TL;DR
This paper explores human-machine synergy in chess, demonstrating how a mixture of experts approach can identify relative advantages and improve decision-making, revealing limitations of human expertise and the potential of reinforcement learning.
Contribution
It introduces a novel MoE-based framework for modeling human-machine teams in chess and analyzes the mechanisms of advantage identification and synergy.
Findings
High potential for human-machine synergy in chess.
Experts quickly saturate in advantage identification.
Reinforcement learning outperforms human experts in recognizing advantages.
Abstract
The field of collective intelligence studies how teams can achieve better results than any of the team members alone. The special case of human-machine teams carries unique challenges in this regard. For example, human teams often achieve synergy by communicating to discover their relative advantages, which is not an option if the team partner is an unexplainable deep neural network. Between 2005-2008 a set of "freestyle" chess tournaments were held, in which human-machine teams known as "centaurs", outperformed the best humans and best machines alone. Centaur players reported that they identified relative advantages between themselves and their chess program, even though the program was superhuman. Inspired by this and leveraging recent open-source models, we study human-machine like teams in chess. A human behavioral clone ("Maia") and a pure self-play RL-trained chess engine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making
