LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM   Output

Elise Karinshak; Amanda Hu; Kewen Kong; Vishwanatha Rao; Jingren Wang,; Jindong Wang; Yi Zeng

arXiv:2411.06032·cs.CL·November 12, 2024·3 cites

LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM Output

Elise Karinshak, Amanda Hu, Kewen Kong, Vishwanatha Rao, Jingren Wang,, Jindong Wang, Yi Zeng

PDF

Open Access 1 Repo

TL;DR

This paper introduces LLM-GLOBE, a benchmark for assessing the cultural values embedded in large language models, comparing Chinese and US models, and highlighting the importance of cultural alignment in AI development.

Contribution

It proposes a novel benchmark based on cultural psychology and a unique 'LLMs-as-a-Jury' evaluation pipeline for large-scale cultural analysis of LLMs.

Findings

01

Identifies cultural similarities and differences between Chinese and US LLMs.

02

Highlights open-generation tasks as effective for cultural value evaluation.

03

Provides insights for AI cultural alignment and human-AI collaboration.

Abstract

Immense effort has been dedicated to minimizing the presence of harmful or biased generative content and better aligning AI output to human intention; however, research investigating the cultural values of LLMs is still in very early stages. Cultural values underpin how societies operate, providing profound insights into the norms, priorities, and decision making of their members. In recognition of this need for further research, we draw upon cultural psychology theory and the empirically-validated GLOBE framework to propose the LLM-GLOBE benchmark for evaluating the cultural value systems of LLMs, and we then leverage the benchmark to compare the values of Chinese and US LLMs. Our methodology includes a novel "LLMs-as-a-Jury" pipeline which automates the evaluation of open-ended content to enable large-scale analysis at a conceptual level. Results clarify similarities and differences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raovish6/LLM-GLOBE
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLibrary Science and Information Systems