Assessing Human Error Against a Benchmark of Perfection
Ashton Anderson, Jon Kleinberg, Sendhil Mullainathan

TL;DR
This paper investigates human decision errors using large-scale chess data, showing that inherent difficulty of a position is a stronger predictor of errors than skill or time constraints.
Contribution
It introduces a framework for predicting human errors based on features like skill, time, and difficulty, with a focus on chess as a model system.
Findings
Difficulty features outperform skill and time in predicting errors.
Large-scale analysis with millions of chess games and tablebases.
Inherent difficulty is a key factor in human decision errors.
Abstract
An increasing number of domains are providing us with detailed trace data on human decisions in settings where we can evaluate the quality of these decisions via an algorithm. Motivated by this development, an emerging line of work has begun to consider whether we can characterize and predict the kinds of decisions where people are likely to make errors. To investigate what a general framework for human error prediction might look like, we focus on a model system with a rich history in the behavioral sciences: the decisions made by chess players as they select moves in a game. We carry out our analysis at a large scale, employing datasets with several million recorded games, and using chess tablebases to acquire a form of ground truth for a subset of chess positions that have been completely solved by computers but remain challenging even for the best players in the world. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
