Automated Rationale Generation: A Technique for Explainable AI and its Effects on Human Perceptions
Upol Ehsan, Pradyumna Tambwekar, Larry Chan, Brent Harrison, Mark, Riedl

TL;DR
This paper presents a method for generating natural language explanations for autonomous agents, demonstrating how different styles of rationales influence human perceptions and preferences in understanding agent behavior.
Contribution
It introduces a neural rationale generator trained on human explanation data and evaluates its effectiveness through user studies in the context of a Frogger-playing agent.
Findings
Generated rationales are perceived as plausible and human-like.
Participants preferred detailed rationales for better mental models.
Alignment exists between rationale features and user perceptions.
Abstract
Automated rationale generation is an approach for real-time explanation generation whereby a computational model learns to translate an autonomous agent's internal state and action data representations into natural language. Training on human explanation data can enable agents to learn to generate human-like explanations for their behavior. In this paper, using the context of an agent that plays Frogger, we describe (a) how to collect a corpus of explanations, (b) how to train a neural rationale generator to produce different styles of rationales, and (c) how people perceive these rationales. We conducted two user studies. The first study establishes the plausibility of each type of generated rationale and situates their user perceptions along the dimensions of confidence, humanlike-ness, adequate justification, and understandability. The second study further explores user preferences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Ethics and Social Impacts of AI
