Getting More from Less: Large Language Models are Good Spontaneous   Multilingual Learners

Shimao Zhang; Changjiang Gao; Wenhao Zhu; Jiajun Chen; Xin Huang; Xue; Han; Junlan Feng; Chao Deng; Shujian Huang

arXiv:2405.13816·cs.CL·June 19, 2024

Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners

Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue, Han, Junlan Feng, Chao Deng, Shujian Huang

PDF

Open Access 2 Repos 8 Models 1 Video

TL;DR

This paper investigates how instruction-tuned large language models can spontaneously improve their multilingual alignment without explicit parallel data, demonstrating their potential for better multilingual understanding and generalization.

Contribution

The study reveals that instruction-tuned LLMs can enhance multilingual alignment spontaneously, even for unseen languages, through question translation data without explicit annotation.

Findings

01

LLMs improve multilingual alignment without explicit parallel data.

02

Instruction tuning on question translation encourages cross-language alignment.

03

LLMs show strong potential for multilingual generalization.

Abstract

Recently, Large Language Models (LLMs) have shown impressive language capabilities. While most of the existing LLMs have very unbalanced performance across different languages, multilingual alignment based on translation parallel data is an effective method to enhance the LLMs' multilingual capabilities. In this work, we discover and comprehensively investigate the spontaneous multilingual alignment improvement of LLMs. We find that LLMs instruction-tuned on the question translation data (i.e. without annotated answers) are able to encourage the alignment between English and a wide range of languages, even including those unseen during instruction-tuning. Additionally, we utilize different settings and mechanistic interpretability methods to analyze the LLM's performance in the multilingual scenario comprehensively. Our work suggests that LLMs have enormous potential for improving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling