Towards Geo-Culturally Grounded LLM Generations

Piyawat Lertvittayakumjorn; David Kinney; Vinodkumar Prabhakaran; Donald Martin Jr.; Sunipa Dev

arXiv:2502.13497·cs.CL·July 17, 2025

Towards Geo-Culturally Grounded LLM Generations

Piyawat Lertvittayakumjorn, David Kinney, Vinodkumar Prabhakaran, Donald Martin Jr., Sunipa Dev

PDF

Open Access 1 Video

TL;DR

This paper explores how retrieval-augmented generation techniques, especially search grounding, can enhance large language models' cultural knowledge, while also discussing associated risks like stereotypes and limitations of knowledge bases.

Contribution

It provides a comparative analysis of KB and search grounding methods, highlighting the effectiveness and challenges of search grounding in improving cultural awareness in LLMs.

Findings

01

Search grounding improves propositional cultural knowledge performance.

02

KB grounding is limited by knowledge base coverage.

03

Search grounding increases stereotypical judgments.

Abstract

Generative large language models (LLMs) have demonstrated gaps in diverse cultural awareness across the globe. We investigate the effect of retrieval augmented generation and search-grounding techniques on LLMs' ability to display familiarity with various national cultures. Specifically, we compare the performance of standard LLMs, LLMs augmented with retrievals from a bespoke knowledge base (i.e., KB grounding), and LLMs augmented with retrievals from a web search (i.e., search grounding) on multiple cultural awareness benchmarks. We find that search grounding significantly improves the LLM performance on multiple-choice benchmarks that test propositional knowledge (e.g., cultural norms, artifacts, and institutions), while KB grounding's effectiveness is limited by inadequate knowledge base coverage and a suboptimal retriever. However, search grounding also increases the risk of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Geo-Culturally Grounded LLM Generations· underline

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies

MethodsBalanced Selection