An Investigation into Value Misalignment in LLM-Generated Texts for Cultural Heritage

Fan Bu; Zheng Wang; Siyi Wang; Ziyao Liu

arXiv:2501.02039·cs.CL·August 22, 2025

An Investigation into Value Misalignment in LLM-Generated Texts for Cultural Heritage

Fan Bu, Zheng Wang, Siyi Wang, Ziyao Liu

PDF

Open Access

TL;DR

This study systematically evaluates the cultural value alignment of open-source LLMs in heritage tasks, revealing high misalignment rates and providing a benchmark and workflow for future improvements.

Contribution

It introduces a comprehensive evaluation framework, a benchmark dataset, and analysis methods for assessing cultural value misalignments in LLM-generated texts.

Findings

01

Over 65% of generated texts show cultural misalignments.

02

Certain tasks have nearly complete cultural value misalignment.

03

The paper provides a new benchmark dataset and evaluation workflow.

Abstract

As Large Language Models (LLMs) become increasingly prevalent in tasks related to cultural heritage, such as generating descriptions of historical monuments, translating ancient texts, preserving oral traditions, and creating educational content, their ability to produce accurate and culturally aligned texts is being increasingly relied upon by users and researchers. However, cultural value misalignments may exist in generated texts, such as the misrepresentation of historical facts, the erosion of cultural identity, and the oversimplification of complex cultural narratives, which may lead to severe consequences. Therefore, investigating value misalignment in the context of LLM for cultural heritage is crucial for mitigating these risks, yet there has been a significant lack of systematic and comprehensive study and investigation in this area. To fill this gap, we systematically assess…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLibrary Science and Information Systems

MethodsSparse Evolutionary Training