Counterfactual Explanations and Algorithmic Recourses for Machine   Learning: A Review

Sahil Verma; Varich Boonsanong; Minh Hoang; Keegan E. Hines; John P.; Dickerson; Chirag Shah

arXiv:2010.10596·cs.LG·November 17, 2022·58 cites

Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review

Sahil Verma, Varich Boonsanong, Minh Hoang, Keegan E. Hines, John P., Dickerson, Chirag Shah

PDF

Open Access

TL;DR

This review paper categorizes and evaluates various counterfactual explanation methods in machine learning, emphasizing their importance for trustworthy AI in high-impact sectors like finance and healthcare.

Contribution

It introduces a comprehensive rubric for assessing counterfactual explanation algorithms and provides a systematic comparison of existing approaches.

Findings

01

Most algorithms aim for interpretability and fidelity.

02

Counterfactual explanations align with legal standards in many countries.

03

Research gaps include scalability and robustness of explanations.

Abstract

Machine learning plays a role in many deployed decision systems, often in ways that are difficult or impossible to understand by human stakeholders. Explaining, in a human-understandable way, the relationship between the input and output of machine learning models is essential to the development of trustworthy machine learning based systems. A burgeoning body of research seeks to define the goals and methods of explainability in machine learning. In this paper, we seek to review and categorize research on counterfactual explanations, a specific class of explanation that provides a link between what could have happened had input to a model been changed in a particular way. Modern approaches to counterfactual explainability in machine learning draw connections to the established legal doctrine in many countries, making them appealing to fielded systems in high-impact areas such as finance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education