Gender, Race, and Intersectional Bias in Resume Screening via Language Model Retrieval
Kyra Wilson, Aylin Caliskan

TL;DR
This study investigates biases in large language models used for resume screening, revealing significant racial and gender biases that mirror real-world employment disparities, with implications for fairness in AI hiring tools.
Contribution
It introduces a document retrieval framework to audit LLMs for bias in resume screening and provides empirical evidence of intersectional biases across multiple occupations.
Findings
LLMs favor White-associated names in 85.1% of cases
Female-associated names are favored in only 11.1% of cases
Black males are disadvantaged in up to 100% of cases
Abstract
Artificial intelligence (AI) hiring tools have revolutionized resume screening, and large language models (LLMs) have the potential to do the same. However, given the biases which are embedded within LLMs, it is unclear whether they can be used in this scenario without disadvantaging groups based on their protected attributes. In this work, we investigate the possibilities of using LLMs in a resume screening setting via a document retrieval framework that simulates job candidate selection. Using that framework, we then perform a resume audit study to determine whether a selection of Massive Text Embedding (MTE) models are biased in resume screening scenarios. We simulate this for nine occupations, using a collection of over 500 publicly available resumes and 500 job descriptions. We find that the MTEs are biased, significantly favoring White-associated names in 85.1\% of cases and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods
