An Investigation into Value Misalignment in LLM-Generated Texts for Cultural Heritage
Fan Bu, Zheng Wang, Siyi Wang, Ziyao Liu

TL;DR
This study systematically evaluates the cultural value alignment of open-source LLMs in heritage tasks, revealing high misalignment rates and providing a benchmark and workflow for future improvements.
Contribution
It introduces a comprehensive evaluation framework, a benchmark dataset, and analysis methods for assessing cultural value misalignments in LLM-generated texts.
Findings
Over 65% of generated texts show cultural misalignments.
Certain tasks have nearly complete cultural value misalignment.
The paper provides a new benchmark dataset and evaluation workflow.
Abstract
As Large Language Models (LLMs) become increasingly prevalent in tasks related to cultural heritage, such as generating descriptions of historical monuments, translating ancient texts, preserving oral traditions, and creating educational content, their ability to produce accurate and culturally aligned texts is being increasingly relied upon by users and researchers. However, cultural value misalignments may exist in generated texts, such as the misrepresentation of historical facts, the erosion of cultural identity, and the oversimplification of complex cultural narratives, which may lead to severe consequences. Therefore, investigating value misalignment in the context of LLM for cultural heritage is crucial for mitigating these risks, yet there has been a significant lack of systematic and comprehensive study and investigation in this area. To fill this gap, we systematically assess…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLibrary Science and Information Systems
MethodsSparse Evolutionary Training
