# Identifying and Reducing Gender Bias in Word-Level Language Models

**Authors:** Shikha Bordia, Samuel R. Bowman

arXiv: 1904.03035 · 2019-04-08

## TL;DR

This paper introduces a metric for measuring gender bias in text and language models, proposes a regularization method to reduce bias, and demonstrates its effectiveness across multiple datasets.

## Contribution

It presents a novel regularization loss to minimize gender bias in word-level language models and evaluates its effectiveness on various corpora.

## Key findings

- Regularization reduces gender bias effectively.
- Optimal weight balances bias reduction and model stability.
- Method generalizes across different datasets.

## Abstract

Many text corpora exhibit socially problematic biases, which can be propagated or amplified in the models trained on such data. For example, doctor cooccurs more frequently with male pronouns than female pronouns. In this study we (i) propose a metric to measure gender bias; (ii) measure bias in a text corpus and the text generated from a recurrent neural network language model trained on the text corpus; (iii) propose a regularization loss term for the language model that minimizes the projection of encoder-trained embeddings onto an embedding subspace that encodes gender; (iv) finally, evaluate efficacy of our proposed method on reducing gender bias. We find this regularization method to be effective in reducing gender bias up to an optimal weight assigned to the loss term, beyond which the model becomes unstable as the perplexity increases. We replicate this study on three training corpora---Penn Treebank, WikiText-2, and CNN/Daily Mail---resulting in similar conclusions.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.03035/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1904.03035/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/1904.03035/full.md

---
Source: https://tomesphere.com/paper/1904.03035