InstructionCP: A fast approach to transfer Large Language Models into   target language

Kuang-Ming Chen; Hung-yi Lee

arXiv:2405.20175·cs.CL·May 31, 2024

InstructionCP: A fast approach to transfer Large Language Models into target language

Kuang-Ming Chen, Hung-yi Lee

PDF

Open Access

TL;DR

InstructionCP introduces an efficient method to adapt large language models to new languages by integrating instruction tags during continual pre-training, preserving conversational abilities and reducing resource needs.

Contribution

It proposes Instruction Continual Pre-training (InsCP), a novel technique that maintains conversational skills while adapting models to new languages using minimal data.

Findings

01

InsCP retains conversational and RLHF abilities.

02

It achieves effective language adaptation with only 0.1 billion tokens.

03

Experimental results confirm improved language alignment and reliability.

Abstract

The rapid development of large language models (LLMs) in recent years has largely focused on English, resulting in models that respond exclusively in English. To adapt these models to other languages, continual pre-training (CP) is often employed, followed by supervised fine-tuning (SFT) to maintain conversational abilities. However, CP and SFT can reduce a model's ability to filter harmful content. We propose Instruction Continual Pre-training (InsCP), which integrates instruction tags into the CP process to prevent loss of conversational proficiency while acquiring new languages. Our experiments demonstrate that InsCP retains conversational and Reinforcement Learning from Human Feedback (RLHF) abilities. Empirical evaluations on language alignment, reliability, and knowledge benchmarks confirm the efficacy of InsCP. Notably, this approach requires only 0.1 billion tokens of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsShrink and Fine-Tune