NormAd: A Framework for Measuring the Cultural Adaptability of Large Language Models

Abhinav Rao; Akhila Yerukola; Vishwa Shah; Katharina Reinecke; Maarten Sap

arXiv:2404.12464·cs.CL·July 10, 2025·1 cites

NormAd: A Framework for Measuring the Cultural Adaptability of Large Language Models

Abhinav Rao, Akhila Yerukola, Vishwa Shah, Katharina Reinecke, Maarten Sap

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

This paper introduces NormAd, a framework and benchmark for evaluating large language models' ability to judge social acceptability across diverse cultural norms, revealing current limitations in socio-cultural reasoning.

Contribution

We propose NormAd, a novel evaluation framework and benchmark to assess LLMs' cultural adaptability, highlighting their struggles with social norms across different cultures.

Findings

01

LLMs perform poorly in judging social acceptability across cultures.

02

Models are more accurate in recognizing acceptable vs. unacceptable situations.

03

Performance drops significantly with abstract norms and less cultural context.

Abstract

To be effectively and safely deployed to global user populations, large language models (LLMs) may need to adapt outputs to user values and cultures, not just know about them. We introduce NormAd, an evaluation framework to assess LLMs' cultural adaptability, specifically measuring their ability to judge social acceptability across varying levels of cultural norm specificity, from abstract values to explicit social norms. As an instantiation of our framework, we create NormAd-Eti, a benchmark of 2.6k situational descriptions representing social-etiquette related cultural norms from 75 countries. Through comprehensive experiments on NormAd-Eti, we find that LLMs struggle to accurately judge social acceptability across these varying degrees of cultural contexts and show stronger adaptability to English-centric cultures over those from the Global South. Even in the simplest setting where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

akhila-yerukola/normad
noneOfficial

Datasets

akhilayerukola/NormAd
dataset· 939 dl
939 dl

Videos

NormAd: A Framework for Measuring the Cultural Adaptability of Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques