LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM Output
Elise Karinshak, Amanda Hu, Kewen Kong, Vishwanatha Rao, Jingren Wang,, Jindong Wang, Yi Zeng

TL;DR
This paper introduces LLM-GLOBE, a benchmark for assessing the cultural values embedded in large language models, comparing Chinese and US models, and highlighting the importance of cultural alignment in AI development.
Contribution
It proposes a novel benchmark based on cultural psychology and a unique 'LLMs-as-a-Jury' evaluation pipeline for large-scale cultural analysis of LLMs.
Findings
Identifies cultural similarities and differences between Chinese and US LLMs.
Highlights open-generation tasks as effective for cultural value evaluation.
Provides insights for AI cultural alignment and human-AI collaboration.
Abstract
Immense effort has been dedicated to minimizing the presence of harmful or biased generative content and better aligning AI output to human intention; however, research investigating the cultural values of LLMs is still in very early stages. Cultural values underpin how societies operate, providing profound insights into the norms, priorities, and decision making of their members. In recognition of this need for further research, we draw upon cultural psychology theory and the empirically-validated GLOBE framework to propose the LLM-GLOBE benchmark for evaluating the cultural value systems of LLMs, and we then leverage the benchmark to compare the values of Chinese and US LLMs. Our methodology includes a novel "LLMs-as-a-Jury" pipeline which automates the evaluation of open-ended content to enable large-scale analysis at a conceptual level. Results clarify similarities and differences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLibrary Science and Information Systems
