Gender, Race, and Intersectional Bias in Resume Screening via Language   Model Retrieval

Kyra Wilson; Aylin Caliskan

arXiv:2407.20371·cs.CY·August 22, 2024·3 cites

Gender, Race, and Intersectional Bias in Resume Screening via Language Model Retrieval

Kyra Wilson, Aylin Caliskan

PDF

Open Access

TL;DR

This study investigates biases in large language models used for resume screening, revealing significant racial and gender biases that mirror real-world employment disparities, with implications for fairness in AI hiring tools.

Contribution

It introduces a document retrieval framework to audit LLMs for bias in resume screening and provides empirical evidence of intersectional biases across multiple occupations.

Findings

01

LLMs favor White-associated names in 85.1% of cases

02

Female-associated names are favored in only 11.1% of cases

03

Black males are disadvantaged in up to 100% of cases

Abstract

Artificial intelligence (AI) hiring tools have revolutionized resume screening, and large language models (LLMs) have the potential to do the same. However, given the biases which are embedded within LLMs, it is unclear whether they can be used in this scenario without disadvantaging groups based on their protected attributes. In this work, we investigate the possibilities of using LLMs in a resume screening setting via a document retrieval framework that simulates job candidate selection. Using that framework, we then perform a resume audit study to determine whether a selection of Massive Text Embedding (MTE) models are biased in resume screening scenarios. We simulate this for nine occupations, using a collection of over 500 publicly available resumes and 500 job descriptions. We find that the MTEs are biased, significantly favoring White-associated names in 85.1\% of cases and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods