A Survey on the Robustness of Feature Importance and Counterfactual   Explanations

Saumitra Mishra; Sanghamitra Dutta; Jason Long; Daniele Magazzeni

arXiv:2111.00358·cs.LG·January 4, 2023·6 cites

A Survey on the Robustness of Feature Importance and Counterfactual Explanations

Saumitra Mishra, Sanghamitra Dutta, Jason Long, Daniele Magazzeni

PDF

Open Access

TL;DR

This survey reviews the robustness of local explanation methods, specifically feature importance and counterfactual explanations, in AI/ML models, emphasizing their reliability and consistency in financial applications.

Contribution

It unifies definitions of robustness, proposes a taxonomy for robustness approaches, and discusses extensions for more reliable explainability methods.

Findings

01

Robustness of explanations varies across methods.

02

Existing approaches lack comprehensive robustness evaluation.

03

Guidelines for improving explanation reliability are discussed.

Abstract

There exist several methods that aim to address the crucial task of understanding the behaviour of AI/ML models. Arguably, the most popular among them are local explanations that focus on investigating model behaviour for individual instances. Several methods have been proposed for local analysis, but relatively lesser effort has gone into understanding if the explanations are robust and accurately reflect the behaviour of underlying models. In this work, we present a survey of the works that analysed the robustness of two classes of local explanations (feature importance and counterfactual explanations) that are popularly used in analysing AI/ML models in finance. The survey aims to unify existing definitions of robustness, introduces a taxonomy to classify different robustness approaches, and discusses some interesting results. Finally, the survey introduces some pointers about…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques