A Bayesian Approach to Identifying Representational Errors

Ramya Ramakrishnan; Vaibhav Unhelkar; Ece Kamar; Julie Shah

arXiv:2103.15171·cs.AI·March 30, 2021

A Bayesian Approach to Identifying Representational Errors

Ramya Ramakrishnan, Vaibhav Unhelkar, Ece Kamar, Julie Shah

PDF

Open Access

TL;DR

This paper introduces GEM, a Bayesian generative model that identifies whether errors in AI systems or humans stem from representational limitations or other factors, aiding targeted improvements.

Contribution

The work presents a novel Bayesian inference method for GEM that effectively distinguishes between representational and non-representational errors in diverse domains.

Findings

01

Successfully recovers blind spots in reinforcement learning agents

02

Effectively identifies representational errors in human users

03

Demonstrates utility across multiple domains

Abstract

Trained AI systems and expert decision makers can make errors that are often difficult to identify and understand. Determining the root cause for these errors can improve future decisions. This work presents Generative Error Model (GEM), a generative model for inferring representational errors based on observations of an actor's behavior (either simulated agent, robot, or human). The model considers two sources of error: those that occur due to representational limitations -- "blind spots" -- and non-representational errors, such as those caused by noise in execution or systematic errors present in the actor's policy. Disambiguating these two error types allows for targeted refinement of the actor's policy (i.e., representational errors require perceptual augmentation, while other errors can be reduced through methods such as improved training or attention support). We present a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning