Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation

Kristian Lum; Jacy Reese Anthis; Kevin Robinson; Chirag Nagpal; Alexander D'Amour

arXiv:2402.12649·cs.CL·June 6, 2025·2 cites

Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation

Kristian Lum, Jacy Reese Anthis, Kevin Robinson, Chirag Nagpal, Alexander D'Amour

PDF

Open Access

TL;DR

This paper critiques current bias benchmarks in large language models, demonstrating they lack robustness in realistic, long-form, context-specific scenarios, and advocates for context-grounded bias evaluations.

Contribution

It introduces RUTEd evaluations for bias in LLMs, showing standard metrics do not reliably predict biases in realistic, long-form applications.

Findings

01

Standard bias metrics do not correlate with realistic bias measures.

02

Current benchmarks are unreliable proxies for real-world AI biases.

03

Context-specific evaluations reveal biases not captured by traditional tests.

Abstract

Standard benchmarks of bias and fairness in large language models (LLMs) measure the association between the user attributes stated or implied by a prompt and the LLM's short text response, but human-AI interaction increasingly requires long-form and context-specific system output to solve real-world tasks. In the commonly studied domain of gender-occupation bias, we test whether these benchmarks are robust to lengthening the LLM responses as a measure of Realistic Use and Tangible Effects (i.e., RUTEd evaluations). From the current literature, we adapt three standard bias metrics (neutrality, skew, and stereotype) and develop analogous RUTEd evaluations from three contexts of real-world use: children's bedtime stories, user personas, and English language learning exercises. We find that standard bias metrics have no significant correlation with the more realistic bias metrics. For…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques