A Taxonomy of Stereotype Content in Large Language Models

Gandalf Nicolas; Aylin Caliskan

arXiv:2408.00162·cs.CY·August 2, 2024·1 cites

A Taxonomy of Stereotype Content in Large Language Models

Gandalf Nicolas, Aylin Caliskan

PDF

Open Access

TL;DR

This paper develops a detailed taxonomy of stereotypes in large language models, revealing that LLMs reflect high-dimensional human stereotypes across multiple social categories and dimensions, with implications for AI bias mitigation.

Contribution

It introduces a comprehensive multidimensional taxonomy of LLM stereotypes and demonstrates its effectiveness in characterizing biases across various social categories.

Findings

01

14 stereotype dimensions identified, covering ~90% of associations

02

Warmth and Competence are the most prevalent stereotype facets

03

LLMs exhibit more positive stereotypes compared to humans

Abstract

This study introduces a taxonomy of stereotype content in contemporary large language models (LLMs). We prompt ChatGPT 3.5, Llama 3, and Mixtral 8x7B, three powerful and widely used LLMs, for the characteristics associated with 87 social categories (e.g., gender, race, occupations). We identify 14 stereotype dimensions (e.g., Morality, Ability, Health, Beliefs, Emotions), accounting for ~90% of LLM stereotype associations. Warmth and Competence facets were the most frequent content, but all other dimensions were significantly prevalent. Stereotypes were more positive in LLMs (vs. humans), but there was significant variability across categories and dimensions. Finally, the taxonomy predicted the LLMs' internal evaluations of social categories (e.g., how positively/negatively the categories were represented), supporting the relevance of a multidimensional taxonomy for characterizing LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Computational and Text Analysis Methods · Migration, Refugees, and Integration

MethodsLLaMA