Explanation Beyond Intuition: A Testable Criterion for Inherent Explainability
Michael Merry, Pat Riddle, Jim Warren

TL;DR
This paper introduces a formal, testable criterion for inherent explainability in AI models using graph theory, enabling verification of explanations and aiding regulatory compliance.
Contribution
It proposes a novel, verifiable criterion for inherent explainability based on graph theory, bridging intuition and formal testing in XAI.
Findings
The criterion matches existing intuitions on explainability.
It explains why some models are inherently explainable or not.
Demonstrates PREDICT model's inherent explainability.
Abstract
Inherent explainability is the gold standard in Explainable Artificial Intelligence (XAI). However, there is not a consistent definition or test to demonstrate inherent explainability. Work to date either characterises explainability through metrics, or appeals to intuition - "we know it when we see it". We propose a globally applicable criterion for inherent explainability. The criterion uses graph theory for representing and decomposing models for structure-local explanation, and recomposing them into global explanations. We form the structure-local explanations as annotations, a verifiable hypothesis-evidence structure that allows for a range of explanatory methods to be used. This criterion matches existing intuitions on inherent explainability, and provides justifications why a large regression model may not be explainable but a sparse neural network could be. We differentiate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
