Human-AI Coordination via Human-Regularized Search and Learning

Hengyuan Hu; David J Wu; Adam Lerer; Jakob Foerster; Noam Brown

arXiv:2210.05125·cs.AI·October 12, 2022

Human-AI Coordination via Human-Regularized Search and Learning

Hengyuan Hu, David J Wu, Adam Lerer, Jakob Foerster, Noam Brown

PDF

Open Access

TL;DR

This paper introduces a three-step approach combining regularized search, behavioral cloning, and reinforcement learning to improve AI-human collaboration in cooperative environments, demonstrating superior performance in human experiments.

Contribution

The paper presents a novel human-regularized search and learning framework that enhances AI coordination with humans, especially in out-of-distribution scenarios.

Findings

01

Outperforms human experts in ad-hoc team settings.

02

Beats baseline methods in repeated human-AI interactions.

03

Effective in diverse, real-world human collaboration scenarios.

Abstract

We consider the problem of making AI agents that collaborate well with humans in partially observable fully cooperative environments given datasets of human behavior. Inspired by piKL, a human-data-regularized search method that improves upon a behavioral cloning policy without diverging far away from it, we develop a three-step algorithm that achieve strong performance in coordinating with real humans in the Hanabi benchmark. We first use a regularized search algorithm and behavioral cloning to produce a better human model that captures diverse skill levels. Then, we integrate the policy regularization idea into reinforcement learning to train a human-like best response to the human model. Finally, we apply regularized search on top of the best response policy at test time to handle out-of-distribution challenges when playing with humans. We evaluate our method in two large scale…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Explainable Artificial Intelligence (XAI)

MethodsTest