Learning Human Rewards by Inferring Their Latent Intelligence Levels in   Multi-Agent Games: A Theory-of-Mind Approach with Application to Driving Data

Ran Tian; Masayoshi Tomizuka; and Liting Sun

arXiv:2103.04289·cs.AI·March 9, 2021·1 cites

Learning Human Rewards by Inferring Their Latent Intelligence Levels in Multi-Agent Games: A Theory-of-Mind Approach with Application to Driving Data

Ran Tian, Masayoshi Tomizuka, and Liting Sun

PDF

Open Access

TL;DR

This paper introduces a novel multi-agent inverse reinforcement learning framework that models humans as bounded rational agents with latent intelligence levels, improving reward function inference in human-robot interaction and driving data analysis.

Contribution

It proposes a Theory-of-Mind inspired approach to infer humans' latent intelligence levels during reward learning in multi-agent settings, addressing limitations of previous rationality assumptions.

Findings

01

Better reward function recovery in synthetic multi-agent games.

02

Improved modeling of human driving behavior from real data.

Abstract

Reward function, as an incentive representation that recognizes humans' agency and rationalizes humans' actions, is particularly appealing for modeling human behavior in human-robot interaction. Inverse Reinforcement Learning is an effective way to retrieve reward functions from demonstrations. However, it has always been challenging when applying it to multi-agent settings since the mutual influence between agents has to be appropriately modeled. To tackle this challenge, previous work either exploits equilibrium solution concepts by assuming humans as perfectly rational optimizers with unbounded intelligence or pre-assigns humans' interaction strategies a priori. In this work, we advocate that humans are bounded rational and have different intelligence levels when reasoning about others' decision-making process, and such an inherent and latent characteristic should be accounted for in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Behavioral Health and Interventions · Experimental Behavioral Economics Studies