ASTERYX : A model-Agnostic SaT-basEd appRoach for sYmbolic and   score-based eXplanations

Ryma Boumazouza (CRIL); Fahima Cheikh-Alili (CRIL); Bertrand Mazure; (CRIL); Karim Tabia (CRIL)

arXiv:2206.11900·cs.AI·June 27, 2022

ASTERYX : A model-Agnostic SaT-basEd appRoach for sYmbolic and score-based eXplanations

Ryma Boumazouza (CRIL), Fahima Cheikh-Alili (CRIL), Bertrand Mazure, (CRIL), Karim Tabia (CRIL)

PDF

TL;DR

ASTERYX is a versatile, model-agnostic framework that generates both symbolic and score-based explanations for complex machine learning models, enhancing interpretability through symbolic encoding and relevance scoring.

Contribution

The paper introduces ASTERYX, a novel declarative approach that encodes models into symbolic representations to produce diverse explanations and relevance scores, regardless of the model type.

Findings

01

Feasibility demonstrated through experiments

02

Effective generation of symbolic explanations like sufficient reasons and counterfactuals

03

Scores reflect explanation relevance and feature importance

Abstract

The ever increasing complexity of machine learning techniques used more and more in practice, gives rise to the need to explain the predictions and decisions of these models, often used as black-boxes. Explainable AI approaches are either numerical feature-based aiming to quantify the contribution of each feature in a prediction or symbolic providing certain forms of symbolic explanations such as counterfactuals. This paper proposes a generic agnostic approach named ASTERYX allowing to generate both symbolic explanations and score-based ones. Our approach is declarative and it is based on the encoding of the model to be explained in an equivalent symbolic representation, this latter serves to generate in particular two types of symbolic explanations which are sufficient reasons and counterfactuals. We then associate scores reflecting the relevance of the explanations and the features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.