Human-AI Collaboration with Bandit Feedback

Ruijiang Gao; Maytal Saar-Tsechansky; Maria De-Arteaga; Ligong Han,; Min Kyung Lee; Matthew Lease

arXiv:2105.10614·cs.HC·December 14, 2021·1 cites

Human-AI Collaboration with Bandit Feedback

Ruijiang Gao, Maytal Saar-Tsechansky, Maria De-Arteaga, Ligong Han,, Min Kyung Lee, Matthew Lease

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach for human-AI collaboration in decision-making using bandit feedback, leveraging human-machine complementarity to outperform individual decision-makers and optimize rewards.

Contribution

It develops a new solution for human-AI collaboration in bandit settings and extends it to multiple human decision-makers, demonstrating improved performance.

Findings

01

Methods outperform individual human and algorithm decisions

02

Personalized routing enhances team performance

03

Effective in both synthetic and real human response scenarios

Abstract

Human-machine complementarity is important when neither the algorithm nor the human yield dominant performance across all instances in a given domain. Most research on algorithmic decision-making solely centers on the algorithm's performance, while recent work that explores human-machine collaboration has framed the decision-making problems as classification tasks. In this paper, we first propose and then develop a solution for a novel human-machine collaboration problem in a bandit feedback setting. Our solution aims to exploit the human-machine complementarity to maximize decision rewards. We then extend our approach to settings with multiple human decision makers. We demonstrate the effectiveness of our proposed methods using both synthetic and real human responses, and find that our methods outperform both the algorithm and the human when they each make decisions on their own. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ruijiang81/hai-blbf
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Advanced Bandit Algorithms Research · Data Stream Mining Techniques