Cross-Language Bias Examination in Large Language Models

Yuxuan Liang; Marwa Mahmoud

arXiv:2512.16029·cs.CY·December 19, 2025

Cross-Language Bias Examination in Large Language Models

Yuxuan Liang, Marwa Mahmoud

PDF

Open Access

TL;DR

This paper presents a new framework for evaluating bias in multilingual large language models, revealing significant cross-lingual bias variations and emphasizing the importance of implicit bias detection.

Contribution

It introduces a comprehensive multilingual bias evaluation framework combining explicit and implicit bias assessments across five languages, filling a key research gap.

Findings

01

Arabic and Spanish show higher stereotype bias

02

Chinese and English exhibit lower bias levels

03

Implicit bias is often higher than explicit bias in age-related assessments

Abstract

This study introduces an innovative multilingual bias evaluation framework for assessing bias in Large Language Models, combining explicit bias assessment through the BBQ benchmark with implicit bias measurement using a prompt-based Implicit Association Test. By translating the prompts and word list into five target languages, English, Chinese, Arabic, French, and Spanish, we directly compare different types of bias across languages. The results reveal substantial gaps in bias across languages used in LLMs. For example, Arabic and Spanish consistently show higher levels of stereotype bias, while Chinese and English exhibit lower levels of bias. We also identify contrasting patterns across bias types. Age shows the lowest explicit bias but the highest implicit bias, emphasizing the importance of detecting implicit biases that are undetectable with standard benchmarks. These findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Ethics and Social Impacts of AI · Authorship Attribution and Profiling