Modeling the Centaur: Human-Machine Synergy in Sequential Decision   Making

David Shoresh; Yonatan Loewenstein

arXiv:2412.18593·cs.HC·March 11, 2025

Modeling the Centaur: Human-Machine Synergy in Sequential Decision Making

David Shoresh, Yonatan Loewenstein

PDF

Open Access 1 Repo

TL;DR

This paper explores human-machine synergy in chess, demonstrating how a mixture of experts approach can identify relative advantages and improve decision-making, revealing limitations of human expertise and the potential of reinforcement learning.

Contribution

It introduces a novel MoE-based framework for modeling human-machine teams in chess and analyzes the mechanisms of advantage identification and synergy.

Findings

01

High potential for human-machine synergy in chess.

02

Experts quickly saturate in advantage identification.

03

Reinforcement learning outperforms human experts in recognizing advantages.

Abstract

The field of collective intelligence studies how teams can achieve better results than any of the team members alone. The special case of human-machine teams carries unique challenges in this regard. For example, human teams often achieve synergy by communicating to discover their relative advantages, which is not an option if the team partner is an unexplainable deep neural network. Between 2005-2008 a set of "freestyle" chess tournaments were held, in which human-machine teams known as "centaurs", outperformed the best humans and best machines alone. Centaur players reported that they identified relative advantages between themselves and their chess program, even though the program was superhuman. Inspired by this and leveraging recent open-source models, we study human-machine like teams in chess. A human behavioral clone ("Maia") and a pure self-play RL-trained chess engine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ReserveJudgement/Centaur-GPT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making