Measuring and Reducing Gendered Correlations in Pre-trained Models

Kellie Webster; Xuezhi Wang; Ian Tenney; Alex Beutel and; Emily Pitler; Ellie Pavlick; Jilin Chen; Ed Chi; Slav Petrov

arXiv:2010.06032·cs.CL·March 4, 2021·106 cites

Measuring and Reducing Gendered Correlations in Pre-trained Models

Kellie Webster, Xuezhi Wang, Ian Tenney, Alex Beutel and, Emily Pitler, Ellie Pavlick, Jilin Chen, Ed Chi, Slav Petrov

PDF

Open Access 5 Models

TL;DR

This paper investigates gendered correlations in pre-trained models, proposing metrics to measure them, demonstrating their variability, and evaluating mitigation strategies to reduce such biases while discussing associated trade-offs.

Contribution

It introduces metrics for measuring gendered correlations, analyzes their variability across models, and evaluates mitigation techniques to reduce bias in pre-trained models.

Findings

01

Models with similar accuracy can encode different levels of gender bias.

02

General-purpose mitigation techniques can reduce gendered correlations.

03

Trade-offs exist between bias reduction and model performance.

Abstract

Pre-trained models have revolutionized natural language understanding. However, researchers have found they can encode artifacts undesired in many applications, such as professions correlating with one gender more than another. We explore such gendered correlations as a case study for how to address unintended correlations in pre-trained models. We define metrics and reveal that it is possible for models with similar accuracy to encode correlations at very different rates. We show how measured correlations can be reduced with general-purpose techniques, and highlight the trade offs different strategies have. With these results, we make recommendations for training robust models: (1) carefully evaluate unintended correlations, (2) be mindful of seemingly innocuous configuration differences, and (3) focus on general mitigations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Artificial Intelligence in Law