Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs

Muhammed Saeed; Muhammad Abdul-mageed; Shady Shehata

arXiv:2511.01187·cs.CL·March 31, 2026

Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs

Muhammed Saeed, Muhammad Abdul-mageed, Shady Shehata

PDF

TL;DR

This paper introduces a multilingual debate-style benchmark to evaluate how large language models reproduce stereotypes across different languages and sensitive domains, revealing biases that persist despite safety efforts.

Contribution

It presents extsc{DebateBias}, a new multilingual benchmark for assessing stereotype emergence in LLMs, highlighting biases in low-resource languages and across diverse cultural contexts.

Findings

01

Models reproduce entrenched stereotypes despite safety measures.

02

Biases are more pronounced in low-resource languages.

03

Current alignment methods do not fully mitigate biases in open-ended generation.

Abstract

Large language models (LLMs) are widely deployed for open-ended communication, yet most bias evaluations still rely on English, classification-style tasks. We introduce \corpusname, a new multilingual, debate-style benchmark designed to reveal how narrative bias appears in realistic generative settings. Our dataset includes 8{,}400 structured debate prompts spanning four sensitive domains -- Women's Rights, Backwardness, Terrorism, and Religion -- across seven languages ranging from high-resource (English, Chinese) to low-resource (Swahili, Nigerian Pidgin). Using four flagship models (GPT-4o, Claude~3.5~Haiku, DeepSeek-Chat, and LLaMA-3-70B), we generate over 100{,}000 debate responses and automatically classify which demographic groups are assigned stereotyped versus modern roles. Results show that all models reproduce entrenched stereotypes despite safety alignment: Arabs are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.