Risks of Cultural Erasure in Large Language Models
Rida Qadri, Aida M. Davani, Kevin Robinson, Vinodkumar Prabhakaran

TL;DR
This paper examines the risks of cultural erasure in large language models, emphasizing the need for sociologically-aware benchmarks to evaluate how models represent and potentially distort global cultures, especially under-represented ones.
Contribution
It introduces a framework for evaluating cultural omission and simplification in language models and proposes benchmarks to assess their cross-cultural impacts.
Findings
Models often omit certain cultures from descriptions.
Cultural simplification leads to one-dimensional representations.
Proposed benchmarks help evaluate socio-cultural impacts.
Abstract
Large language models are increasingly being integrated into applications that shape the production and discovery of societal knowledge such as search, online education, and travel planning. As a result, language models will shape how people learn about, perceive and interact with global cultures making it important to consider whose knowledge systems and perspectives are represented in models. Recognizing this importance, increasingly work in Machine Learning and NLP has focused on evaluating gaps in global cultural representational distribution within outputs. However, more work is needed on developing benchmarks for cross-cultural impacts of language models that stem from a nuanced sociologically-aware conceptualization of cultural impact or harm. We join this line of work arguing for the need of metricizable evaluations of language technologies that interrogate and account for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Computational and Text Analysis Methods
MethodsEmirates Airlines Office in Dubai · Sparse Evolutionary Training · Focus
