Evaluating LLMs for Gender Disparities in Notable Persons
Lauren Rhue, Sofie Goethals, Arun Sundararajan

TL;DR
This paper investigates gender biases in Large Language Models, revealing persistent disparities in responses related to gender, despite improvements in newer models, and explores their origins in prompt associations and response patterns.
Contribution
It provides a comprehensive evaluation of gender disparities in GPT models, highlighting specific biases and analyzing their sources in prompt and response characteristics.
Findings
Gender disparities persist in GPT-3.5 responses.
GPT-4 shows improvements but still exhibits biases.
Biases are linked to prompt gender associations and response homogeneity.
Abstract
This study examines the use of Large Language Models (LLMs) for retrieving factual information, addressing concerns over their propensity to produce factually incorrect "hallucinated" responses or to altogether decline to even answer prompt at all. Specifically, it investigates the presence of gender-based biases in LLMs' responses to factual inquiries. This paper takes a multi-pronged approach to evaluating GPT models by evaluating fairness across multiple dimensions of recall, hallucinations and declinations. Our findings reveal discernible gender disparities in the responses generated by GPT-3.5. While advancements in GPT-4 have led to improvements in performance, they have not fully eradicated these gender disparities, notably in instances where responses are declined. The study further explores the origins of these disparities by examining the influence of gender associations in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Transformer · Residual Connection · Weight Decay
