Methodology of Adapting Large English Language Models for Specific   Cultural Contexts

Wenjing Zhang; Siqi Xiao; Xuejiao Lei; Ning Wang; Huazheng; Zhang; Meijuan An; Bikun Yang; Zhaoxiang Liu; Kai Wang; Shiguo; Lian

arXiv:2406.18192·cs.CL·June 28, 2024·1 cites

Methodology of Adapting Large English Language Models for Specific Cultural Contexts

Wenjing Zhang, Siqi Xiao, Xuejiao Lei, Ning Wang, Huazheng, Zhang, Meijuan An, Bikun Yang, Zhaoxiang Liu, Kai Wang, Shiguo, Lian

PDF

Open Access

TL;DR

This paper presents a rapid adaptation method for large language models to better handle specific cultural contexts, demonstrated through Chinese cultural adaptation of LLaMA3-8B, improving domain knowledge and safety value alignment.

Contribution

It introduces a novel instruction-tuning approach leveraging cultural knowledge data to adapt large models for specific cultural contexts efficiently.

Findings

01

Enhanced domain-specific knowledge in the adapted model

02

Improved alignment with safety values

03

Maintained original model capabilities

Abstract

The rapid growth of large language models(LLMs) has emerged as a prominent trend in the field of artificial intelligence. However, current state-of-the-art LLMs are predominantly based on English. They encounter limitations when directly applied to tasks in specific cultural domains, due to deficiencies in domain-specific knowledge and misunderstandings caused by differences in cultural values. To address this challenge, our paper proposes a rapid adaptation method for large models in specific cultural contexts, which leverages instruction-tuning based on specific cultural knowledge and safety values data. Taking Chinese as the specific cultural context and utilizing the LLaMA3-8B as the experimental English LLM, the evaluation results demonstrate that the adapted LLM significantly enhances its capabilities in domain-specific knowledge and adaptability to safety values, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSubtitles and Audiovisual Media