Altruist: Argumentative Explanations through Local Interpretations of Predictive Models
Ioannis Mollas, Nick Bassiliades, Grigorios Tsoumakas

TL;DR
Altruist introduces a novel meta-explanation method combining logic-based argumentation with interpretable machine learning to identify truthful feature importance explanations, enhancing interpretability and evaluation of explanation techniques.
Contribution
It presents a new meta-explanation approach that evaluates and combines multiple feature importance techniques for more truthful explanations in AI models.
Findings
Ensemble of interpretation techniques improves explanation truthfulness.
Meta-explanation method can evaluate and select suitable explanation techniques.
Approach enhances comprehensibility of AI model rationales.
Abstract
Explainable AI is an emerging field providing solutions for acquiring insights into automated systems' rationale. It has been put on the AI map by suggesting ways to tackle key ethical and societal issues. Existing explanation techniques are often not comprehensible to the end user. Lack of evaluation and selection criteria also makes it difficult for the end user to choose the most suitable technique. In this study, we combine logic-based argumentation with Interpretable Machine Learning, introducing a preliminary meta-explanation methodology that identifies the truthful parts of feature importance oriented interpretations. This approach, in addition to being used as a meta-explanation technique, can be used as an evaluation or selection tool for multiple feature importance techniques. Experimentation strongly indicates that an ensemble of multiple interpretation techniques yields…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
