Evaluating Gender, Racial, and Age Biases in Large Language Models: A Comparative Analysis of Occupational and Crime Scenarios
Vishal Mirza, Rahul Kulkarni, Aakanksha Jadhav

TL;DR
This study evaluates biases in recent large language models across occupational and crime scenarios, revealing persistent disparities and limitations in current bias mitigation techniques affecting fairness and reliability.
Contribution
The paper provides a comparative analysis of bias levels in four leading LLMs, highlighting the effectiveness and shortcomings of existing bias mitigation strategies.
Findings
LLMs show a 37% deviation from US BLS occupational data in gender representation.
Bias in crime scenarios deviates by 54% for gender, 28% for race, and 17% for age from FBI data.
Current bias mitigation efforts can sometimes overcorrect, worsening disparities.
Abstract
Recent advancements in Large Language Models(LLMs) have been notable, yet widespread enterprise adoption remains limited due to various constraints. This paper examines bias in LLMs-a crucial issue affecting their usability, reliability, and fairness. Researchers are developing strategies to mitigate bias, including debiasing layers, specialized reference datasets like Winogender and Winobias, and reinforcement learning with human feedback (RLHF). These techniques have been integrated into the latest LLMs. Our study evaluates gender bias in occupational scenarios and gender, age, and racial bias in crime scenarios across four leading LLMs released in 2024: Gemini 1.5 Pro, Llama 3 70B, Claude 3 Opus, and GPT-4o. Findings reveal that LLMs often depict female characters more frequently than male ones in various occupations, showing a 37% deviation from US BLS data. In crime scenarios,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Gender Studies in Language
MethodsLLaMA
