Gender Bias in Contextualized Word Embeddings

Jieyu Zhao; Tianlu Wang; Mark Yatskar; Ryan Cotterell; Vicente; Ordonez; Kai-Wei Chang

arXiv:1904.03310·cs.CL·April 9, 2019·37 cites

Gender Bias in Contextualized Word Embeddings

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente, Ordonez, Kai-Wei Chang

PDF

Open Access 2 Repos

TL;DR

This paper investigates gender bias in ELMo's contextualized word embeddings, revealing data-driven biases and proposing mitigation strategies to reduce such biases in downstream tasks.

Contribution

The study provides a comprehensive analysis of gender bias in ELMo embeddings and introduces methods to effectively mitigate this bias.

Findings

01

ELMo training data contains more male than female entities.

02

ELMo embeddings encode gender information systematically.

03

Bias in ELMo affects coreference systems and can be mitigated.

Abstract

In this paper, we quantify, analyze and mitigate gender bias exhibited in ELMo's contextualized word vectors. First, we conduct several intrinsic analyses and find that (1) training data for ELMo contains significantly more male than female entities, (2) the trained ELMo embeddings systematically encode gender information and (3) ELMo unequally encodes gender information about male and female entities. Then, we show that a state-of-the-art coreference system that depends on ELMo inherits its bias and demonstrates significant bias on the WinoBias probing corpus. Finally, we explore two methods to mitigate such gender bias and show that the bias demonstrated on WinoBias can be eliminated.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · Softmax · ELMo