Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models

Cheonbok Park; Jeonghoon Kim; Joosung Lee; Sanghwan Bae; Jaegul Choo; and Kang Min Yoo

arXiv:2506.05850·cs.CL·February 24, 2026

Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models

Cheonbok Park, Jeonghoon Kim, Joosung Lee, Sanghwan Bae, Jaegul Choo, and Kang Min Yoo

PDF

Open Access

TL;DR

This paper investigates how multilingual large language models tend to revert to their dominant language during reasoning tasks, revealing a trade-off between reasoning depth and language fidelity, and explores methods to mitigate this issue.

Contribution

It formalizes the phenomenon of Cross-lingual Collapse in LLMs, systematically studies its causes, and proposes interventions to reduce language drift during reasoning.

Findings

01

CoT reasoning drifts toward dominant language as performance improves

02

Language drift is amplified by priors, task difficulty, and decoding strategies

03

Interventions can mitigate collapse but introduce a performance-fidelity trade-off

Abstract

Reinforcement learning with verifiable reward (RLVR) has been instrumental in eliciting strong reasoning capabilities from large language models (LLMs) via long chains of thought (CoT). During RLVR training, we formalize and systemically study an empirical phenomenon whereby a multilingual model's CoT reverts to its dominant pre-training language (e.g., English) even when prompted in another language, which we term Cross-lingual Collapse. Because the long-CoT regime magnifies exposure to linguistic priors, the underlying trade-off between maximizing reasoning depth and preserving target-language fidelity has remained under-characterized. To examine this trade-off, we train LLMs with Group-Relative Policy Optimization (GRPO) on translated versions of math datasets widely used to elicit long-CoT reasoning. Throughout training, we track both task accuracy and the language consistency of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)