Formal Abductive Latent Explanations for Prototype-Based Networks
Jules Soria, Zakaria Chihani, Julien Girard-Satabin, Alban Grastien, Romain Xu-Darme, Daniela Cancila

TL;DR
This paper introduces Abductive Latent Explanations (ALEs), a formal method to generate more reliable and formalized explanations for prototype-based neural networks, improving interpretability especially in safety-critical applications.
Contribution
It proposes ALEs, a novel formalism for explanations in prototype-based networks, combining interpretability with formal guarantees and offering scalable algorithms for diverse datasets.
Findings
ALEs improve explanation reliability in prototype-based models
The proposed algorithms are scalable and effective on image classification tasks
ALEs provide formal guarantees on the sufficiency of explanations
Abstract
Case-based reasoning networks are machine-learning models that make predictions based on similarity between the input and prototypical parts of training samples, called prototypes. Such models are able to explain each decision by pointing to the prototypes that contributed the most to the final outcome. As the explanation is a core part of the prediction, they are often qualified as ``interpretable by design". While promising, we show that such explanations are sometimes misleading, which hampers their usefulness in safety-critical contexts. In particular, several instances may lead to different predictions and yet have the same explanation. Drawing inspiration from the field of formal eXplainable AI (FXAI), we propose Abductive Latent Explanations (ALEs), a formalism to express sufficient conditions on the intermediate (latent) representation of the instance that imply the prediction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Bayesian Modeling and Causal Inference
