Towards Compositional Interpretability for XAI

Sean Tull; Robin Lorenz; Stephen Clark; Ilyas Khan; Bob Coecke

arXiv:2406.17583·cs.AI·June 26, 2024·2 cites

Towards Compositional Interpretability for XAI

Sean Tull, Robin Lorenz, Stephen Clark, Ilyas Khan, Bob Coecke

PDF

Open Access

TL;DR

This paper introduces a category theory-based framework for AI interpretability, proposing compositional models that unify various AI paradigms and enhance explainability through diagrammatic analysis.

Contribution

It presents a novel, formal approach to interpretability using compositional models and diagrammatic reasoning, applicable across multiple AI model types.

Findings

01

Diagrammatic explanations improve transparency of models.

02

Compositional structure enables inference and analysis.

03

Framework unifies deterministic, probabilistic, and quantum models.

Abstract

Artificial intelligence (AI) is currently based largely on black-box machine learning models which lack interpretability. The field of eXplainable AI (XAI) strives to address this major concern, being critical in high-stakes areas such as the finance, legal and health sectors. We present an approach to defining AI models and their interpretability based on category theory. For this we employ the notion of a compositional model, which sees a model in terms of formal string diagrams which capture its abstract structure together with its concrete implementation. This comprehensive view incorporates deterministic, probabilistic and quantum models. We compare a wide range of AI models as compositional models, including linear and rule-based models, (recurrent) neural networks, transformers, VAEs, and causal and DisCoCirc models. Next we give a definition of interpretation of a model in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies