How Reliable and Stable are Explanations of XAI Methods?
Jos\'e Ribeiro, Lucas Cardoso, Vitor Santos, Eduardo Carvalho, and N\'ikolas Carneiro, Ronnie Alves

TL;DR
This study evaluates the reliability and stability of various XAI methods by analyzing their explanations under data perturbations using multiple models and a diabetes dataset.
Contribution
It introduces a pipeline for assessing XAI explanation stability and reliability, highlighting the robustness of the eXirt method over others.
Findings
eXirt identified the most reliable models.
Most XAI methods are sensitive to data perturbations.
One method remained stable despite perturbations.
Abstract
Black box models are increasingly being used in the daily lives of human beings living in society. Along with this increase, there has been the emergence of Explainable Artificial Intelligence (XAI) methods aimed at generating additional explanations regarding how the model makes certain predictions. In this sense, methods such as Dalex, Eli5, eXirt, Lofo and Shap emerged as different proposals and methodologies for generating explanations of black box models in an agnostic way. Along with the emergence of these methods, questions arise such as "How Reliable and Stable are XAI Methods?". With the aim of shedding light on this main question, this research creates a pipeline that performs experiments using the diabetes dataset and four different machine learning models (LGBM, MLP, DT and KNN), creating different levels of perturbations of the test data and finally generates explanations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsShapley Additive Explanations
