Plan Explanations as Model Reconciliation: Moving Beyond Explanation as Soliloquy
Tathagata Chakraborti, Sarath Sreedharan, Yu Zhang, Subbarao, Kambhampati

TL;DR
This paper proposes a new approach to AI plan explanations by framing them as model reconciliation problems, where the AI suggests model adjustments to align human understanding with its plans.
Contribution
It introduces the concept of explanations as model reconciliation, develops algorithms for computing such explanations, and evaluates their effectiveness in bridging model differences.
Findings
Algorithms successfully generate explanations that reconcile model differences.
Reconciliation-based explanations improve human understanding of AI plans.
The approach outperforms traditional soliloquy explanations in complex scenarios.
Abstract
When AI systems interact with humans in the loop, they are often called on to provide explanations for their plans and behavior. Past work on plan explanations primarily involved the AI system explaining the correctness of its plan and the rationale for its decision in terms of its own model. Such soliloquy is wholly inadequate in most realistic scenarios where the humans have domain and task models that differ significantly from that used by the AI system. We posit that the explanations are best studied in light of these differing models. In particular, we show how explanation can be seen as a "model reconciliation problem" (MRP), where the AI system in effect suggests changes to the human's model, so as to make its plan be optimal with respect to that changed human model. We will study the properties of such explanations, present algorithms for automatically computing them, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Bayesian Modeling and Causal Inference · Semantic Web and Ontologies
