An Evaluation of Cultural Value Alignment in LLM
Nicholas Sukiennik, Chen Gao, Fengli Xu, Yong Li

TL;DR
This study conducts a large-scale evaluation of how well large language models (LLMs) align with diverse cultural values across 20 countries, revealing biases and proposing metrics for cultural alignment.
Contribution
It introduces the first comprehensive assessment of LLM cultural alignment across multiple countries and models, highlighting biases and factors influencing cultural representation.
Findings
US is the best-aligned country in LLM outputs
GLM-4 shows the highest ability to align with cultural values
Models tend to align more with US culture than Chinese culture
Abstract
LLMs as intelligent agents are being increasingly applied in scenarios where human interactions are involved, leading to a critical concern about whether LLMs are faithful to the variations in culture across regions. Several works have investigated this question in various ways, finding that there are biases present in the cultural representations of LLM outputs. To gain a more comprehensive view, in this work, we conduct the first large-scale evaluation of LLM culture assessing 20 countries' cultures and languages across ten LLMs. With a renowned cultural values questionnaire and by carefully analyzing LLM output with human ground truth scores, we thoroughly study LLMs' cultural alignment across countries and among individual models. Our findings show that the output over all models represents a moderate cultural middle ground. Given the overall skew, we propose an alignment metric,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDispute Resolution and Class Actions
MethodsALIGN
