Semantic Pivots Enable Cross-Lingual Transfer in Large Language Models

Kaiyu He; Tong Zhou; Yubo Chen; Delai Qiu; Shengping Liu; Kang Liu; Jun Zhao

arXiv:2505.16385·cs.CL·May 23, 2025

Semantic Pivots Enable Cross-Lingual Transfer in Large Language Models

Kaiyu He, Tong Zhou, Yubo Chen, Delai Qiu, Shengping Liu, Kang Liu, Jun Zhao

PDF

Open Access

TL;DR

This paper investigates how large language models acquire cross-lingual abilities by analyzing intermediate layer behaviors, identifying semantic pivots, and reconstructing datasets to enhance multilingual transfer.

Contribution

It introduces a novel Word-Level Cross-Lingual Translation Task and a semantic pivot-aware dataset to improve LLMs' cross-lingual transfer capabilities.

Findings

01

Identified co-occurrence and semantic pivot behaviors in LLMs

02

Semantic pivot-based dataset improves cross-lingual performance

03

Enhanced interpretability of LLMs in multilingual tasks

Abstract

Large language models (LLMs) demonstrate remarkable ability in cross-lingual tasks. Understanding how LLMs acquire this ability is crucial for their interpretability. To quantify the cross-lingual ability of LLMs accurately, we propose a Word-Level Cross-Lingual Translation Task. To find how LLMs learn cross-lingual ability, we trace the outputs of LLMs' intermediate layers in the word translation task. We identify and distinguish two distinct behaviors in the forward pass of LLMs: co-occurrence behavior and semantic pivot behavior. We attribute LLMs' two distinct behaviors to the co-occurrence frequency of words and find the semantic pivot from the pre-training dataset. Finally, to apply our findings to improve the cross-lingual ability of LLMs, we reconstruct a semantic pivot-aware pre-training dataset using documents with a high proportion of semantic pivots. Our experiments validate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)