ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage
Wenhao Ye, Tiansheng Zheng, Yue Qi, Wenhua Zhao, Xiyu Wang, Xue Zhao, Jiacheng He, Yaya Zheng, Dongbo Wang

TL;DR
This paper introduces ICH-Qwen, a large language model trained on Chinese intangible cultural heritage data, aimed at aiding preservation, dissemination, and research of cultural assets amidst modernization challenges.
Contribution
The study develops a specialized large language model for Chinese ICH, integrating synthetic data and fine-tuning to enhance domain-specific understanding and applications.
Findings
ICH-Qwen effectively performs ICH-specific tasks
The model supports preservation and dissemination efforts
It offers new tools for digital humanities research
Abstract
The intangible cultural heritage (ICH) of China, a cultural asset transmitted across generations by various ethnic groups, serves as a significant testament to the evolution of human civilization and holds irreplaceable value for the preservation of historical lineage and the enhancement of cultural self-confidence. However, the rapid pace of modernization poses formidable challenges to ICH, including threats damage, disappearance and discontinuity of inheritance. China has the highest number of items on the UNESCO Intangible Cultural Heritage List, which is indicative of the nation's abundant cultural resources and emphasises the pressing need for ICH preservation. In recent years, the rapid advancements in large language modelling have provided a novel technological approach for the preservation and dissemination of ICH. This study utilises a substantial corpus of open-source Chinese…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
