Probing Pre-Trained Language Models for Cross-Cultural Differences in Values

Arnav Arora; Lucie-Aim\'ee Kaffee; Isabelle Augenstein

arXiv:2203.13722·cs.CL·August 29, 2025·6 cites

Probing Pre-Trained Language Models for Cross-Cultural Differences in Values

Arnav Arora, Lucie-Aim\'ee Kaffee, Isabelle Augenstein

PDF

Open Access 1 Repo

TL;DR

This paper investigates how pre-trained language models encode cultural values, revealing they capture cross-cultural differences but only weakly align with established surveys, highlighting challenges and opportunities for alignment.

Contribution

Introduces probes to analyze cultural values in language models and compares their embeddings with existing cross-cultural value surveys.

Findings

01

PTLMs encode cross-cultural value differences

02

Weak alignment between model embeddings and surveys

03

Implications for using PTLMs in multicultural contexts

Abstract

Language embeds information about social, cultural, and political values people hold. Prior work has explored social and potentially harmful biases encoded in Pre-Trained Language models (PTLMs). However, there has been no systematic study investigating how values embedded in these models vary across cultures. In this paper, we introduce probes to study which values across cultures are embedded in these models, and whether they align with existing theories and cross-cultural value surveys. We find that PTLMs capture differences in values across cultures, but those only weakly align with established value surveys. We discuss implications of using mis-aligned models in cross-cultural settings, as well as ways of aligning PTLMs with value surveys.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

copenlu/value-probing
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducator Training and Historical Pedagogy · Computational and Text Analysis Methods · Social Media and Politics

MethodsALIGN