Probing the Emergence of Cross-lingual Alignment during LLM Training

Hetong Wang; Pasquale Minervini; Edoardo M. Ponti

arXiv:2406.13229·cs.CL·June 21, 2024·1 cites

Probing the Emergence of Cross-lingual Alignment during LLM Training

Hetong Wang, Pasquale Minervini, Edoardo M. Ponti

PDF

Open Access 1 Video

TL;DR

This paper investigates how cross-lingual alignment naturally emerges during the training of multilingual LLMs, revealing a strong correlation between neuron overlap and transfer performance, and identifying phases of alignment degradation.

Contribution

It introduces intrinsic probing techniques to analyze neuron-level cross-lingual alignment during LLM pre-training, providing new insights into multilingual training dynamics.

Findings

01

Neuron overlap correlates with zero-shot transfer performance.

02

Alignment degrades at certain training phases.

03

Higher model scales show better cross-lingual alignment.

Abstract

Multilingual Large Language Models (LLMs) achieve remarkable levels of zero-shot cross-lingual transfer performance. We speculate that this is predicated on their ability to align languages without explicit supervision from parallel sentences. While representations of translationally equivalent sentences in different languages are known to be similar after convergence, however, it remains unclear how such cross-lingual alignment emerges during pre-training of LLMs. Our study leverages intrinsic probing techniques, which identify which subsets of neurons encode linguistic features, to correlate the degree of cross-lingual neuron overlap with the zero-shot cross-lingual transfer performance for a given model. In particular, we rely on checkpoints of BLOOM, a multilingual autoregressive LLM, across different training steps and model scales. We observe a high correlation between neuron…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Probing the Emergence of Cross-lingual Alignment during LLM Training· underline

Taxonomy

TopicsNatural Language Processing Techniques · Interpreting and Communication in Healthcare · Translation Studies and Practices

MethodsALIGN · BLOOM