Exploring Subjectivity for more Human-Centric Assessment of Social   Biases in Large Language Models

Paula Akemi Aoyagui; Sharon Ferguson; Anastasia Kuzminykh

arXiv:2405.11048·cs.HC·May 21, 2024

Exploring Subjectivity for more Human-Centric Assessment of Social Biases in Large Language Models

Paula Akemi Aoyagui, Sharon Ferguson, Anastasia Kuzminykh

PDF

Open Access

TL;DR

This paper emphasizes the importance of incorporating human subjective judgment alongside automated metrics to better detect nuanced and context-dependent social biases in large language models, advancing human-centric evaluation methods.

Contribution

It highlights the limitations of automated bias detection and advocates for human evaluation to capture nuanced biases, proposing a human-centered approach for more effective assessment.

Findings

01

Automated metrics may miss nuanced biases.

02

Human evaluators can identify context-dependent biases.

03

A combined approach improves bias detection accuracy.

Abstract

An essential aspect of evaluating Large Language Models (LLMs) is identifying potential biases. This is especially relevant considering the substantial evidence that LLMs can replicate human social biases in their text outputs and further influence stakeholders, potentially amplifying harm to already marginalized individuals and communities. Therefore, recent efforts in bias detection invested in automated benchmarks and objective metrics such as accuracy (i.e., an LLMs output is compared against a predefined ground truth). Nonetheless, social biases can be nuanced, oftentimes subjective and context-dependent, where a situation is open to interpretation and there is no ground truth. While these situations can be difficult for automated evaluation systems to identify, human evaluators could potentially pick up on these nuances. In this paper, we discuss the role of human evaluation and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Mental Health Research Topics · Mental Health via Writing