Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models
Hannah Kirk, Yennie Jun, Haider Iqbal, Elias Benussi, Filippo Volpin,, Frederic A. Dreyer, Aleksandar Shtedritski, Yuki M. Asano

TL;DR
This paper empirically analyzes intersectional occupational biases in GPT-2, revealing stereotypical associations and societal biases reflected in the model's generated text, raising questions about normative learning.
Contribution
It provides a detailed intersectional bias analysis of GPT-2's occupational associations, highlighting the influence of societal biases and the model's reflection or correction of real-world inequalities.
Findings
GPT-2's job predictions are less diverse and more stereotypical for women.
Intersectional interactions significantly influence occupational associations.
GPT-2 often mirrors societal gender and ethnicity distributions, sometimes correcting for biases.
Abstract
The capabilities of natural language models trained on large-scale data have increased immensely over the past few years. Open source libraries such as HuggingFace have made these models easily available and accessible. While prior research has identified biases in large language models, this paper considers biases contained in the most popular versions of these models when applied `out-of-the-box' for downstream tasks. We focus on generative language models as they are well-suited for extracting biases inherited from training data. Specifically, we conduct an in-depth analysis of GPT-2, which is the most downloaded text generation model on HuggingFace, with over half a million downloads per month. We assess biases related to occupational associations for different protected categories by intersecting gender with religion, sexuality, ethnicity, political affiliation, and continental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComputational and Text Analysis Methods · Topic Modeling
MethodsLinear Layer · Cosine Annealing · Layer Normalization · Residual Connection · Attention Dropout · Discriminative Fine-Tuning · Multi-Head Attention · Adam · Linear Warmup With Cosine Annealing · Weight Decay
