Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey
Richard Dazeley, Peter Vamplew, Francisco Cruz

TL;DR
This paper introduces a conceptual framework called CXF that unifies explainable reinforcement learning (XRL) research and aims to develop Broad-XAI by integrating RL with external knowledge and communication tools.
Contribution
It proposes the Causal XRL Framework (CXF), unifying current XRL research and enabling RL to support Broad-XAI through integration with ontologies and explanation capabilities.
Findings
Proposes the CXF framework for unified XRL research
Highlights RL's potential for Broad-XAI development
Suggests integration of external knowledge for explanations
Abstract
Broad Explainable Artificial Intelligence moves away from interpreting individual decisions based on a single datum and aims to provide integrated explanations from multiple machine learning algorithms into a coherent explanation of an agent's behaviour that is aligned to the communication needs of the explainee. Reinforcement Learning (RL) methods, we propose, provide a potential backbone for the cognitive model required for the development of Broad-XAI. RL represents a suite of approaches that have had increasing success in solving a range of sequential decision-making problems. However, these algorithms all operate as black-box problem solvers, where they obfuscate their decision-making policy through a complex array of values and functions. EXplainable RL (XRL) is relatively recent field of research that aims to develop techniques to extract concepts from the agent's: perception of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
