A Taxonomy of Stereotype Content in Large Language Models
Gandalf Nicolas, Aylin Caliskan

TL;DR
This paper develops a detailed taxonomy of stereotypes in large language models, revealing that LLMs reflect high-dimensional human stereotypes across multiple social categories and dimensions, with implications for AI bias mitigation.
Contribution
It introduces a comprehensive multidimensional taxonomy of LLM stereotypes and demonstrates its effectiveness in characterizing biases across various social categories.
Findings
14 stereotype dimensions identified, covering ~90% of associations
Warmth and Competence are the most prevalent stereotype facets
LLMs exhibit more positive stereotypes compared to humans
Abstract
This study introduces a taxonomy of stereotype content in contemporary large language models (LLMs). We prompt ChatGPT 3.5, Llama 3, and Mixtral 8x7B, three powerful and widely used LLMs, for the characteristics associated with 87 social categories (e.g., gender, race, occupations). We identify 14 stereotype dimensions (e.g., Morality, Ability, Health, Beliefs, Emotions), accounting for ~90% of LLM stereotype associations. Warmth and Competence facets were the most frequent content, but all other dimensions were significantly prevalent. Stereotypes were more positive in LLMs (vs. humans), but there was significant variability across categories and dimensions. Finally, the taxonomy predicted the LLMs' internal evaluations of social categories (e.g., how positively/negatively the categories were represented), supporting the relevance of a multidimensional taxonomy for characterizing LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Computational and Text Analysis Methods · Migration, Refugees, and Integration
MethodsLLaMA
