Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating   Representative and Affinity Bias in Large Language Models

Abhishek Kumar; Sarfaroz Yunusov; and Ali Emami

arXiv:2405.14555·cs.CL·June 4, 2024

Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models

Abhishek Kumar, Sarfaroz Yunusov, and Ali Emami

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces dual metrics, RBS and ABS, to detect subtle representative and affinity biases in large language models, revealing significant biases and complex human-model bias interactions.

Contribution

The study presents novel metrics and a suite of open-ended tasks to measure and analyze subtle biases in LLMs, advancing bias detection methods.

Findings

01

Marked biases towards white, straight, male identities in LLMs

02

Distinctive bias patterns or 'fingerprints' in model evaluations

03

Human evaluators also exhibit bias patterns similar to models

Abstract

Research on Large Language Models (LLMs) has often neglected subtle biases that, although less apparent, can significantly influence the models' outputs toward particular social narratives. This study addresses two such biases within LLMs: representative bias, which denotes a tendency of LLMs to generate outputs that mirror the experiences of certain identity groups, and affinity bias, reflecting the models' evaluative preferences for specific narratives or viewpoints. We introduce two novel metrics to measure these biases: the Representative Bias Score (RBS) and the Affinity Bias Score (ABS), and present the Creativity-Oriented Generation Suite (CoGS), a collection of open-ended tasks such as short story writing and poetry composition, designed with customized rubrics to detect these subtle biases. Our analysis uncovers marked representative biases in prominent LLMs, with a preference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

akkeshav/subtlebias
none

Videos

Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models· underline

Taxonomy

TopicsComputational and Text Analysis Methods