Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Abhishek Kumar, Sarfaroz Yunusov, and Ali Emami

TL;DR
This paper introduces dual metrics, RBS and ABS, to detect subtle representative and affinity biases in large language models, revealing significant biases and complex human-model bias interactions.
Contribution
The study presents novel metrics and a suite of open-ended tasks to measure and analyze subtle biases in LLMs, advancing bias detection methods.
Findings
Marked biases towards white, straight, male identities in LLMs
Distinctive bias patterns or 'fingerprints' in model evaluations
Human evaluators also exhibit bias patterns similar to models
Abstract
Research on Large Language Models (LLMs) has often neglected subtle biases that, although less apparent, can significantly influence the models' outputs toward particular social narratives. This study addresses two such biases within LLMs: representative bias, which denotes a tendency of LLMs to generate outputs that mirror the experiences of certain identity groups, and affinity bias, reflecting the models' evaluative preferences for specific narratives or viewpoints. We introduce two novel metrics to measure these biases: the Representative Bias Score (RBS) and the Affinity Bias Score (ABS), and present the Creativity-Oriented Generation Suite (CoGS), a collection of open-ended tasks such as short story writing and poetry composition, designed with customized rubrics to detect these subtle biases. Our analysis uncovers marked representative biases in prominent LLMs, with a preference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComputational and Text Analysis Methods
