Measuring Unfairness through Game-Theoretic Interpretability

Juliana Cesaro; Fabio G. Cozman

arXiv:1910.05591·cs.LG·October 15, 2019

Measuring Unfairness through Game-Theoretic Interpretability

Juliana Cesaro, Fabio G. Cozman

PDF

TL;DR

This paper explores the relationship between fairness measures and feature importance, specifically evaluating and comparing them using SHAP across datasets prone to unfairness.

Contribution

It introduces methods to evaluate and compare fairness and feature importance measures, focusing on SHAP, filling a gap in existing research.

Findings

01

SHAP effectively highlights unfairness in datasets.

02

Comparison methods reveal differences between fairness and feature importance measures.

03

Results suggest potential for better interpretability of unfairness in classifiers.

Abstract

One often finds in the literature connections between measures of fairness and measures of feature importance employed to interpret trained classifiers. However, there seems to be no study that compares fairness measures and feature importance measures. In this paper we propose ways to evaluate and compare such measures. We focus in particular on SHAP, a game-theoretic measure of feature importance; we present results for a number of unfairness-prone datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsShapley Additive Explanations