AdaCultureSafe: Adaptive Cultural Safety Grounded by Cultural Knowledge in Large Language Models
Hankun Kang, Di Lin, Zhirong Liao, Pengfei Bai, Xinyi Zeng, Jiawei Jiang, Yuanyuan Zhu, Tieyun Qian

TL;DR
This paper introduces AdaCultureSafe, a dataset and framework for improving cultural safety in LLMs by grounding responses in authoritative cultural knowledge, addressing challenges of cultural diversity and subtlety.
Contribution
It presents a novel dataset and a knowledge-grounded approach to enhance cultural safety in LLMs, bridging the gap between cultural knowledge and safety.
Findings
No significant correlation between cultural safety and knowledge proficiency in LLMs.
The proposed method significantly improves cultural safety in LLM responses.
A new dataset with 4.8K cultural descriptions and 48K queries was created.
Abstract
With the widespread adoption of Large Language Models (LLMs), respecting indigenous cultures becomes essential for models' culturally safety and responsible global applications. Existing studies separately consider cultural safety and cultural knowledge and neglect that the former should be grounded by the latter. This severely prevents LLMs from yielding culture-specific respectful responses. Consequently, adaptive cultural safety remains a formidable task. In this work, we propose to jointly model cultural safety and knowledge. First and foremost, cultural-safety and knowledge-paired data serve as the key prerequisite to conduct this research. However, the cultural diversity across regions and the subtlety of cultural differences pose significant challenges to the creation of such paired evaluation data. To address this issue, we propose a novel framework that integrates authoritative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Computational and Text Analysis Methods · Language and cultural evolution
