Formal Abductive Latent Explanations for Prototype-Based Networks

Jules Soria; Zakaria Chihani; Julien Girard-Satabin; Alban Grastien; Romain Xu-Darme; Daniela Cancila

arXiv:2511.16588·cs.AI·November 21, 2025

Formal Abductive Latent Explanations for Prototype-Based Networks

Jules Soria, Zakaria Chihani, Julien Girard-Satabin, Alban Grastien, Romain Xu-Darme, Daniela Cancila

PDF

Open Access 2 Videos

TL;DR

This paper introduces Abductive Latent Explanations (ALEs), a formal method to generate more reliable and formalized explanations for prototype-based neural networks, improving interpretability especially in safety-critical applications.

Contribution

It proposes ALEs, a novel formalism for explanations in prototype-based networks, combining interpretability with formal guarantees and offering scalable algorithms for diverse datasets.

Findings

01

ALEs improve explanation reliability in prototype-based models

02

The proposed algorithms are scalable and effective on image classification tasks

03

ALEs provide formal guarantees on the sufficiency of explanations

Abstract

Case-based reasoning networks are machine-learning models that make predictions based on similarity between the input and prototypical parts of training samples, called prototypes. Such models are able to explain each decision by pointing to the prototypes that contributed the most to the final outcome. As the explanation is a core part of the prediction, they are often qualified as ``interpretable by design". While promising, we show that such explanations are sometimes misleading, which hampers their usefulness in safety-critical contexts. In particular, several instances may lead to different predictions and yet have the same explanation. Drawing inspiration from the field of formal eXplainable AI (FXAI), we propose Abductive Latent Explanations (ALEs), a formalism to express sufficient conditions on the intermediate (latent) representation of the instance that imply the prediction.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Formal Abductive Latent Explanations for Prototype-Based Networks· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Bayesian Modeling and Causal Inference