Bias Out-of-the-Box: An Empirical Analysis of Intersectional   Occupational Biases in Popular Generative Language Models

Hannah Kirk; Yennie Jun; Haider Iqbal; Elias Benussi; Filippo Volpin,; Frederic A. Dreyer; Aleksandar Shtedritski; Yuki M. Asano

arXiv:2102.04130·cs.CL·October 29, 2021·26 cites

Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models

Hannah Kirk, Yennie Jun, Haider Iqbal, Elias Benussi, Filippo Volpin,, Frederic A. Dreyer, Aleksandar Shtedritski, Yuki M. Asano

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper empirically analyzes intersectional occupational biases in GPT-2, revealing stereotypical associations and societal biases reflected in the model's generated text, raising questions about normative learning.

Contribution

It provides a detailed intersectional bias analysis of GPT-2's occupational associations, highlighting the influence of societal biases and the model's reflection or correction of real-world inequalities.

Findings

01

GPT-2's job predictions are less diverse and more stereotypical for women.

02

Intersectional interactions significantly influence occupational associations.

03

GPT-2 often mirrors societal gender and ethnicity distributions, sometimes correcting for biases.

Abstract

The capabilities of natural language models trained on large-scale data have increased immensely over the past few years. Open source libraries such as HuggingFace have made these models easily available and accessible. While prior research has identified biases in large language models, this paper considers biases contained in the most popular versions of these models when applied `out-of-the-box' for downstream tasks. We focus on generative language models as they are well-suited for extracting biases inherited from training data. Specifically, we conduct an in-depth analysis of GPT-2, which is the most downloaded text generation model on HuggingFace, with over half a million downloads per month. We assess biases related to occupational associations for different protected categories by intersecting gender with religion, sexuality, ethnicity, political affiliation, and continental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oxai/intersectional_gpt2
noneOfficial

Videos

Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models· slideslive

Taxonomy

TopicsComputational and Text Analysis Methods · Topic Modeling

MethodsLinear Layer · Cosine Annealing · Layer Normalization · Residual Connection · Attention Dropout · Discriminative Fine-Tuning · Multi-Head Attention · Adam · Linear Warmup With Cosine Annealing · Weight Decay