Measuring Gender and Racial Biases in Large Language Models
Jiafu An, Difang Huang, Chen Lin, Mingzhu Tai

TL;DR
This study evaluates gender and racial biases in OpenAI's GPT when used for high-stakes resume screening, revealing biases that could impact fairness in AI-driven hiring decisions and highlighting the need for bias mitigation strategies.
Contribution
It provides a large-scale empirical analysis of social biases in GPT in a realistic decision-making scenario, identifying specific patterns and their potential impact on fairness.
Findings
GPT awards higher scores to female candidates with similar qualifications.
GPT scores lower for black male candidates with similar qualifications.
Biases could influence hiring probabilities by 1-2 percentage points.
Abstract
In traditional decision making processes, social biases of human decision makers can lead to unequal economic outcomes for underrepresented social groups, such as women, racial or ethnic minorities. Recently, the increasing popularity of Large language model based artificial intelligence suggests a potential transition from human to AI based decision making. How would this impact the distributional outcomes across social groups? Here we investigate the gender and racial biases of OpenAIs GPT, a widely used LLM, in a high stakes decision making setting, specifically assessing entry level job candidates from diverse social groups. Instructing GPT to score approximately 361000 resumes with randomized social identities, we find that the LLM awards higher assessment scores for female candidates with similar work experience, education, and skills, while lower scores for black male candidates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
MethodsAttention Is All You Need · Linear Layer · Attention Dropout · Residual Connection · Cosine Annealing · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Linear Warmup With Cosine Annealing · Softmax · Discriminative Fine-Tuning
