A Model for Intelligible Interaction Between Agents That Predict and Explain
A. Baskar, Ashwin Srinivasan, Michael Bain, Enrico Coiera

TL;DR
This paper formalizes a model for human-ML system interaction focusing on prediction and explanation, defining protocols for intelligibility and analyzing conditions for effective communication.
Contribution
It introduces a formal automata-based model for intelligible interaction, bridging literature and natural language dialogue in human-ML communication.
Findings
Identifies conditions for bounded run-time sequences in interaction protocols.
Demonstrates how to map real-world interactions to the formal model.
Shows the applicability of the model to logic-based explanations and natural language dialogues.
Abstract
Machine Learning (ML) has emerged as a powerful form of data modelling with widespread applicability beyond its roots in the design of autonomous agents. However, relatively little attention has been paid to the interaction between people and ML systems. In this paper we view interaction between humans and ML systems within the broader context of communication between agents capable of prediction and explanation. We formalise the interaction model by taking agents to be automata with some special characteristics and define a protocol for communication between such agents. We define One- and Two-Way Intelligibility as properties that emerge at run-time by execution of the protocol. The formalisation allows us to identify conditions under which run-time sequences are bounded, and identify conditions under which the protocol can correctly implement an axiomatic specification of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Data Visualization and Analytics · Neural Networks and Applications
