Axiomatic Characterisations of Sample-based Explainers
Leila Amgoud, Martin C. Cooper, Salim Debbaoui

TL;DR
This paper provides an axiomatic framework for understanding sample-based explainers of black-box classifiers, characterizing their properties, relationships, and identifying families that satisfy key desirable criteria.
Contribution
It introduces the first broad family of explainers that guarantee explanation existence and global consistency, with polynomial-time explanation finding methods.
Findings
Characterized all explainers satisfying compatible properties
Identified a family of explainers with guaranteed existence and consistency
Provided polynomial-time algorithms for explanation retrieval
Abstract
Explaining decisions of black-box classifiers is both important and computationally challenging. In this paper, we scrutinize explainers that generate feature-based explanations from samples or datasets. We start by presenting a set of desirable properties that explainers would ideally satisfy, delve into their relationships, and highlight incompatibilities of some of them. We identify the entire family of explainers that satisfy two key properties which are compatible with all the others. Its instances provide sufficient reasons, called weak abductive explanations.We then unravel its various subfamilies that satisfy subsets of compatible properties. Indeed, we fully characterize all the explainers that satisfy any subset of compatible properties. In particular, we introduce the first (broad family of) explainers that guarantee the existence of explanations and their global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
