BayLing 2: A Multilingual Large Language Model with Efficient Language   Alignment

Shaolei Zhang; Kehao Zhang; Qingkai Fang; Shoutao Guo; Yan Zhou,; Xiaodong Liu; Yang Feng

arXiv:2411.16300·cs.CL·December 20, 2024

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Shaolei Zhang, Kehao Zhang, Qingkai Fang, Shoutao Guo, Yan Zhou,, Xiaodong Liu, Yang Feng

PDF

Open Access 1 Repo

TL;DR

BayLing 2 is a multilingual large language model that efficiently transfers knowledge from high-resource to low-resource languages using language alignment and instruction tuning, significantly improving performance across over 100 languages.

Contribution

Introduces BayLing 2, a model that leverages language alignment and a large instruction dataset to enhance multilingual capabilities without extensive instruction data for each language.

Findings

01

Outperforms similar-scale open-source models in multilingual translation.

02

Achieves significant improvements in low-resource language understanding.

03

Maintains high performance in high-resource languages while improving low-resource language capabilities.

Abstract

Large language models (LLMs), with their powerful generative capabilities and vast knowledge, empower various tasks in everyday life. However, these abilities are primarily concentrated in high-resource languages, leaving low-resource languages with weaker generative capabilities and relatively limited knowledge. Enhancing the multilingual capabilities of LLMs is therefore crucial for serving over 100 linguistic communities worldwide. An intuitive approach to enhance the multilingual capabilities would be to construct instruction data for various languages, but constructing instruction data for over 100 languages is prohibitively costly. In this paper, we introduce BayLing 2, which efficiently transfers generative capabilities and knowledge from high-resource languages to low-resource languages through language alignment. To achieve this, we constructed a dataset of 3.2 million…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ictnlp/bayling
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsLLaMA