Gender bias and stereotypes in Large Language Models
Hadas Kotek, Rikker Dockum, and David Q. Sun

TL;DR
This paper examines gender bias in Large Language Models, revealing they amplify stereotypes, ignore ambiguities, and rationalize biased choices, highlighting the need for careful testing to ensure fairness.
Contribution
It introduces a simple paradigm to test gender bias in LLMs, demonstrating their tendency to reinforce stereotypes and provide inaccurate rationalizations.
Findings
LLMs are 3-6 times more likely to choose stereotypical occupations.
Choices align more with perceptions than ground truth.
LLMs amplify existing gender biases.
Abstract
Large Language Models (LLMs) have made substantial progress in the past several months, shattering state-of-the-art benchmarks in many domains. This paper investigates LLMs' behavior with respect to gender stereotypes, a known issue for prior models. We use a simple paradigm to test the presence of gender bias, building on but differing from WinoBias, a commonly used gender bias dataset, which is likely to be included in the training data of current LLMs. We test four recently published LLMs and demonstrate that they express biased assumptions about men and women's occupations. Our contributions in this paper are as follows: (a) LLMs are 3-6 times more likely to choose an occupation that stereotypically aligns with a person's gender; (b) these choices align with people's perceptions better than with the ground truth as reflected in official job statistics; (c) LLMs in fact amplify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
