# FlipTest: Fairness Testing via Optimal Transport

**Authors:** Emily Black, Samuel Yeom, Matt Fredrikson

arXiv: 1906.09218 · 2019-12-10

## TL;DR

FlipTest is a black-box method that uses optimal transport to identify discrimination in classifiers by matching individuals across protected groups and analyzing changes in model outputs.

## Contribution

It introduces a novel optimal transport-based approach to detect and explain model discrimination without causal assumptions.

## Key findings

- Effectively identifies potentially discriminated subgroups.
- Works even when models satisfy group fairness.
- Provides interpretable feature importance for discrimination.

## Abstract

We present FlipTest, a black-box technique for uncovering discrimination in classifiers. FlipTest is motivated by the intuitive question: had an individual been of a different protected status, would the model have treated them differently? Rather than relying on causal information to answer this question, FlipTest leverages optimal transport to match individuals in different protected groups, creating similar pairs of in-distribution samples. We show how to use these instances to detect discrimination by constructing a "flipset": the set of individuals whose classifier output changes post-translation, which corresponds to the set of people who may be harmed because of their group membership. To shed light on why the model treats a given subgroup differently, FlipTest produces a "transparency report": a ranking of features that are most associated with the model's behavior on the flipset. Evaluating the approach on three case studies, we show that this provides a computationally inexpensive way to identify subgroups that may be harmed by model discrimination, including in cases where the model satisfies group fairness criteria.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.09218/full.md

## Figures

36 figures with captions in the complete paper: https://tomesphere.com/paper/1906.09218/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/1906.09218/full.md

---
Source: https://tomesphere.com/paper/1906.09218