Representation Learning with Statistical Independence to Mitigate Bias

Ehsan Adeli; Qingyu Zhao; Adolf Pfefferbaum; Edith V. Sullivan; Li; Fei-Fei; Juan Carlos Niebles; Kilian M. Pohl

arXiv:1910.03676·cs.CV·November 23, 2020

Representation Learning with Statistical Independence to Mitigate Bias

Ehsan Adeli, Qingyu Zhao, Adolf Pfefferbaum, Edith V. Sullivan, Li, Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl

PDF

2 Repos

TL;DR

This paper introduces a novel adversarial training method for learning unbiased representations in machine learning models, effectively reducing bias related to protected variables while maintaining high predictive performance across various datasets.

Contribution

It proposes a new adversarial loss function that minimizes statistical dependence between learned features and bias variables, improving fairness without sacrificing accuracy.

Findings

01

Effective bias mitigation demonstrated on synthetic and real datasets.

02

Achieves superior prediction performance with reduced bias.

03

Applicable to medical images and gender classification tasks.

Abstract

Presence of bias (in datasets or tasks) is inarguably one of the most critical challenges in machine learning applications that has alluded to pivotal debates in recent years. Such challenges range from spurious associations between variables in medical studies to the bias of race in gender or face recognition systems. Controlling for all types of biases in the dataset curation stage is cumbersome and sometimes impossible. The alternative is to use the available data and build models incorporating fair representation learning. In this paper, we propose such a model based on adversarial training with two competing objectives to learn features that have (1) maximum discriminative power with respect to the task and (2) minimal statistical mean dependence with the protected (bias) variable(s). Our approach does so by incorporating a new adversarial loss function that encourages a vanished…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.