A Survey on the Robustness of Feature Importance and Counterfactual Explanations
Saumitra Mishra, Sanghamitra Dutta, Jason Long, Daniele Magazzeni

TL;DR
This survey reviews the robustness of local explanation methods, specifically feature importance and counterfactual explanations, in AI/ML models, emphasizing their reliability and consistency in financial applications.
Contribution
It unifies definitions of robustness, proposes a taxonomy for robustness approaches, and discusses extensions for more reliable explainability methods.
Findings
Robustness of explanations varies across methods.
Existing approaches lack comprehensive robustness evaluation.
Guidelines for improving explanation reliability are discussed.
Abstract
There exist several methods that aim to address the crucial task of understanding the behaviour of AI/ML models. Arguably, the most popular among them are local explanations that focus on investigating model behaviour for individual instances. Several methods have been proposed for local analysis, but relatively lesser effort has gone into understanding if the explanations are robust and accurately reflect the behaviour of underlying models. In this work, we present a survey of the works that analysed the robustness of two classes of local explanations (feature importance and counterfactual explanations) that are popularly used in analysing AI/ML models in finance. The survey aims to unify existing definitions of robustness, introduces a taxonomy to classify different robustness approaches, and discusses some interesting results. Finally, the survey introduces some pointers about…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques
