Investigating Intersectional Bias in Large Language Models using Confidence Disparities in Coreference Resolution

Falaah Arif Khan; Nivedha Sivakumar; Yinong Oliver Wang; Katherine Metcalf; Cezanne Camacho; Barry-John Theobald; Luca Zappella; Nicholas Apostoloff

arXiv:2508.07111·cs.CL·August 12, 2025

Investigating Intersectional Bias in Large Language Models using Confidence Disparities in Coreference Resolution

Falaah Arif Khan, Nivedha Sivakumar, Yinong Oliver Wang, Katherine Metcalf, Cezanne Camacho, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff

PDF

Open Access

TL;DR

This paper introduces a new benchmark and metric to evaluate intersectional bias in large language models, revealing significant confidence disparities across multiple demographic intersections and highlighting concerns about model fairness and reasoning.

Contribution

It extends bias evaluation to intersectional identities, creates the WinoIdentity benchmark, and proposes the Coreference Confidence Disparity metric for assessing fairness in LLMs.

Findings

01

Confidence disparities up to 40% across demographic attributes.

02

Models are most uncertain about doubly-disadvantaged identities.

03

Confidence decreases even for privileged markers, indicating memorization over reasoning.

Abstract

Large language models (LLMs) have achieved impressive performance, leading to their widespread adoption as decision-support tools in resource-constrained contexts like hiring and admissions. There is, however, scientific consensus that AI systems can reflect and exacerbate societal biases, raising concerns about identity-based harm when used in critical social contexts. Prior work has laid a solid foundation for assessing bias in LLMs by evaluating demographic disparities in different language reasoning tasks. In this work, we extend single-axis fairness evaluations to examine intersectional bias, recognizing that when multiple axes of discrimination intersect, they create distinct patterns of disadvantage. We create a new benchmark called WinoIdentity by augmenting the WinoBias dataset with 25 demographic markers across 10 attributes, including age, nationality, and race, intersected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Artificial Intelligence in Healthcare and Education · Computational and Text Analysis Methods