Towards Compositional Interpretability for XAI
Sean Tull, Robin Lorenz, Stephen Clark, Ilyas Khan, Bob Coecke

TL;DR
This paper introduces a category theory-based framework for AI interpretability, proposing compositional models that unify various AI paradigms and enhance explainability through diagrammatic analysis.
Contribution
It presents a novel, formal approach to interpretability using compositional models and diagrammatic reasoning, applicable across multiple AI model types.
Findings
Diagrammatic explanations improve transparency of models.
Compositional structure enables inference and analysis.
Framework unifies deterministic, probabilistic, and quantum models.
Abstract
Artificial intelligence (AI) is currently based largely on black-box machine learning models which lack interpretability. The field of eXplainable AI (XAI) strives to address this major concern, being critical in high-stakes areas such as the finance, legal and health sectors. We present an approach to defining AI models and their interpretability based on category theory. For this we employ the notion of a compositional model, which sees a model in terms of formal string diagrams which capture its abstract structure together with its concrete implementation. This comprehensive view incorporates deterministic, probabilistic and quantum models. We compare a wide range of AI models as compositional models, including linear and rule-based models, (recurrent) neural networks, transformers, VAEs, and causal and DisCoCirc models. Next we give a definition of interpretation of a model in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies
