Measuring and Reducing Gendered Correlations in Pre-trained Models
Kellie Webster, Xuezhi Wang, Ian Tenney, Alex Beutel and, Emily Pitler, Ellie Pavlick, Jilin Chen, Ed Chi, Slav Petrov

TL;DR
This paper investigates gendered correlations in pre-trained models, proposing metrics to measure them, demonstrating their variability, and evaluating mitigation strategies to reduce such biases while discussing associated trade-offs.
Contribution
It introduces metrics for measuring gendered correlations, analyzes their variability across models, and evaluates mitigation techniques to reduce bias in pre-trained models.
Findings
Models with similar accuracy can encode different levels of gender bias.
General-purpose mitigation techniques can reduce gendered correlations.
Trade-offs exist between bias reduction and model performance.
Abstract
Pre-trained models have revolutionized natural language understanding. However, researchers have found they can encode artifacts undesired in many applications, such as professions correlating with one gender more than another. We explore such gendered correlations as a case study for how to address unintended correlations in pre-trained models. We define metrics and reveal that it is possible for models with similar accuracy to encode correlations at very different rates. We show how measured correlations can be reduced with general-purpose techniques, and highlight the trade offs different strategies have. With these results, we make recommendations for training robust models: (1) carefully evaluate unintended correlations, (2) be mindful of seemingly innocuous configuration differences, and (3) focus on general mitigations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Artificial Intelligence in Law
