Null-sampling for Interpretable and Fair Representations

Thomas Kehrenberg; Myles Bartlett; Oliver Thomas; Novi Quadrianto

arXiv:2008.05248·cs.LG·August 13, 2020

Null-sampling for Interpretable and Fair Representations

Thomas Kehrenberg, Myles Bartlett, Oliver Thomas, Novi Quadrianto

PDF

1 Repo

TL;DR

This paper introduces a method for learning invariant, interpretable representations in data to improve fairness, robustness, and transparency in machine learning models, especially under biased training conditions.

Contribution

It proposes a novel adversarial null-sampling approach to produce invariant data representations, enhancing interpretability and fairness in biased datasets.

Findings

01

Effective on image and tabular datasets

02

Produces human-examinable data domain representations

03

Improves fairness under biased training conditions

Abstract

We propose to learn invariant representations, in the data domain, to achieve interpretability in algorithmic fairness. Invariance implies a selectivity for high level, relevant correlations w.r.t. class label annotations, and a robustness to irrelevant correlations with protected characteristics such as race or gender. We introduce a non-trivial setup in which the training set exhibits a strong bias such that class label annotations are irrelevant and spurious correlations cannot be distinguished. To address this problem, we introduce an adversarially trained model with a null-sampling procedure to produce invariant representations in the data domain. To enable disentanglement, a partially-labelled representative set is used. By placing the representations into the data domain, the changes made by the model are easily examinable by human auditors. We show the effectiveness of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

predictive-analytics-lab/nifr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability