On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll, Rohin Shah, Mark K. Ho, Thomas L. Griffiths, Sanjit A., Seshia, Pieter Abbeel, Anca Dragan

TL;DR
This paper investigates how agents trained with self-play and population-based methods perform poorly when coordinating with humans, emphasizing the importance of learning about human behavior for effective human-AI collaboration.
Contribution
The paper introduces a simple environment based on Overcooked to evaluate human-AI coordination and demonstrates that agents trained without human models struggle to coordinate effectively with humans.
Findings
Agents trained via self-play perform poorly with humans.
Incorporating human models improves coordination with humans.
Human-aware planning enhances human-AI collaboration.
Abstract
While we would like agents that can coordinate with humans, current algorithms such as self-play and population-based training create agents that can coordinate with themselves. Agents that assume their partner to be optimal or similar to them can converge to coordination protocols that fail to understand and be understood by humans. To demonstrate this, we introduce a simple environment that requires challenging coordination, based on the popular game Overcooked, and learn a simple model that mimics human play. We evaluate the performance of agents trained via self-play and population-based training. These agents perform very well when paired with themselves, but when paired with our human model, they are significantly worse than agents designed to play with the human model. An experiment with a planning algorithm yields the same conclusion, though only when the human-aware planner is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multi-Agent Systems and Negotiation
