ASTERYX : A model-Agnostic SaT-basEd appRoach for sYmbolic and score-based eXplanations
Ryma Boumazouza (CRIL), Fahima Cheikh-Alili (CRIL), Bertrand Mazure, (CRIL), Karim Tabia (CRIL)

TL;DR
ASTERYX is a versatile, model-agnostic framework that generates both symbolic and score-based explanations for complex machine learning models, enhancing interpretability through symbolic encoding and relevance scoring.
Contribution
The paper introduces ASTERYX, a novel declarative approach that encodes models into symbolic representations to produce diverse explanations and relevance scores, regardless of the model type.
Findings
Feasibility demonstrated through experiments
Effective generation of symbolic explanations like sufficient reasons and counterfactuals
Scores reflect explanation relevance and feature importance
Abstract
The ever increasing complexity of machine learning techniques used more and more in practice, gives rise to the need to explain the predictions and decisions of these models, often used as black-boxes. Explainable AI approaches are either numerical feature-based aiming to quantify the contribution of each feature in a prediction or symbolic providing certain forms of symbolic explanations such as counterfactuals. This paper proposes a generic agnostic approach named ASTERYX allowing to generate both symbolic explanations and score-based ones. Our approach is declarative and it is based on the encoding of the model to be explained in an equivalent symbolic representation, this latter serves to generate in particular two types of symbolic explanations which are sufficient reasons and counterfactuals. We then associate scores reflecting the relevance of the explanations and the features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
