Can Explainable AI Explain Unfairness? A Framework for Evaluating   Explainable AI

Kiana Alikhademi; Brianna Richardson; Emma Drobina; and Juan E.; Gilbert

arXiv:2106.07483·cs.CY·June 17, 2021·23 cites

Can Explainable AI Explain Unfairness? A Framework for Evaluating Explainable AI

Kiana Alikhademi, Brianna Richardson, Emma Drobina, and Juan E., Gilbert

PDF

Open Access

TL;DR

This paper proposes a framework to evaluate explainable AI tools in their ability to detect bias and fairness issues, highlighting current limitations and guiding improvements to prevent fairwashing.

Contribution

It introduces a novel framework for assessing XAI tools' effectiveness in fairness detection and communication, addressing a critical gap in current explainability methods.

Findings

01

Many XAI tools lack bias detection features

02

Framework helps identify modifications to improve fairness communication

03

Guides developers to reduce fairwashing risks

Abstract

Many ML models are opaque to humans, producing decisions too complex for humans to easily understand. In response, explainable artificial intelligence (XAI) tools that analyze the inner workings of a model have been created. Despite these tools' strength in translating model behavior, critiques have raised concerns about the impact of XAI tools as a tool for `fairwashing` by misleading users into trusting biased or incorrect models. In this paper, we created a framework for evaluating explainable AI tools with respect to their capabilities for detecting and addressing issues of bias and fairness as well as their capacity to communicate these results to their users clearly. We found that despite their capabilities in simplifying and explaining model behavior, many prominent XAI tools lack features that could be critical in detecting bias. Developers can use our framework to suggest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI