Probing the Emergence of Cross-lingual Alignment during LLM Training
Hetong Wang, Pasquale Minervini, Edoardo M. Ponti

TL;DR
This paper investigates how cross-lingual alignment naturally emerges during the training of multilingual LLMs, revealing a strong correlation between neuron overlap and transfer performance, and identifying phases of alignment degradation.
Contribution
It introduces intrinsic probing techniques to analyze neuron-level cross-lingual alignment during LLM pre-training, providing new insights into multilingual training dynamics.
Findings
Neuron overlap correlates with zero-shot transfer performance.
Alignment degrades at certain training phases.
Higher model scales show better cross-lingual alignment.
Abstract
Multilingual Large Language Models (LLMs) achieve remarkable levels of zero-shot cross-lingual transfer performance. We speculate that this is predicated on their ability to align languages without explicit supervision from parallel sentences. While representations of translationally equivalent sentences in different languages are known to be similar after convergence, however, it remains unclear how such cross-lingual alignment emerges during pre-training of LLMs. Our study leverages intrinsic probing techniques, which identify which subsets of neurons encode linguistic features, to correlate the degree of cross-lingual neuron overlap with the zero-shot cross-lingual transfer performance for a given model. In particular, we rely on checkpoints of BLOOM, a multilingual autoregressive LLM, across different training steps and model scales. We observe a high correlation between neuron…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Interpreting and Communication in Healthcare · Translation Studies and Practices
MethodsALIGN · BLOOM
