Which Demographics do LLMs Default to During Annotation?
Johannes Sch\"afer, Aidan Combs, Christopher Bagdon, Jiahui Li, Nadine Probol, Lynn Greschner, Sean Papay, Yarik Menchaca Resendiz, Aswathy Velutharambath, Amelie W\"uhrl, Sabine Weber, Roman Klinger

TL;DR
This paper investigates which demographic attributes large language models (LLMs) implicitly adopt during annotation tasks, revealing notable influences of gender, race, and age that contrast with prior findings.
Contribution
It combines bias analysis and prompt manipulation to identify the demographics LLMs mimic when no explicit demographic information is provided.
Findings
LLMs show biases related to gender, race, and age in annotation tasks.
Demographic prompts significantly influence LLM responses.
Contrasts with previous studies that found no demographic effects.
Abstract
Demographics and cultural background of annotators influence the labels they assign in text annotation -- for instance, an elderly woman might find it offensive to read a message addressed to a "bro", but a male teenager might find it appropriate. It is therefore important to acknowledge label variations to not under-represent members of a society. Two research directions developed out of this observation in the context of using large language models (LLM) for data annotations, namely (1) studying biases and inherent knowledge of LLMs and (2) injecting diversity in the output by manipulating the prompt with demographic information. We combine these two strands of research and ask the question to which demographics an LLM resorts to when no demographics is given. To answer this question, we evaluate which attributes of human annotators LLMs inherently mimic. Furthermore, we compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques
