A Unifying Bayesian Formulation of Measures of Interpretability in   Human-AI

Sarath Sreedharan; Anagha Kulkarni; David E. Smith; Subbarao; Kambhampati

arXiv:2104.10743·cs.AI·April 23, 2021

A Unifying Bayesian Formulation of Measures of Interpretability in Human-AI

Sarath Sreedharan, Anagha Kulkarni, David E. Smith, Subbarao, Kambhampati

PDF

Open Access

TL;DR

This paper introduces a Bayesian framework that unifies various interpretability measures in human-AI interactions, modeling human beliefs and revealing the open-world nature of human-robot interactions.

Contribution

It presents a comprehensive Bayesian model that captures explicability, legibility, and predictability as special cases, unifying interpretability measures under a single framework.

Findings

01

Unified Bayesian framework for interpretability measures

02

Revealed open-world aspect of human-AI interactions

03

Model accommodates human belief updates and hypotheses

Abstract

Existing approaches for generating human-aware agent behaviors have considered different measures of interpretability in isolation. Further, these measures have been studied under differing assumptions, thus precluding the possibility of designing a single framework that captures these measures under the same assumptions. In this paper, we present a unifying Bayesian framework that models a human observer's evolving beliefs about an agent and thereby define the problem of Generalized Human-Aware Planning. We will show that the definitions of interpretability measures like explicability, legibility and predictability from the prior literature fall out as special cases of our general framework. Through this framework, we also bring a previously ignored fact to light that the human-robot interactions are in effect open-world problems, particularly as a result of modeling the human's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · AI-based Problem Solving and Planning · Reinforcement Learning in Robotics