TL;DR
This study investigates multilingual latent reasoning in large reasoning models across 11 languages, revealing resource-dependent strengths and internal prediction mechanisms consistent with English, despite surface-level differences.
Contribution
It provides the first systematic analysis of multilingual latent reasoning in LRMs, highlighting resource effects and internal mechanism similarities across languages.
Findings
Latent reasoning exists in multiple languages, stronger in resource-rich ones.
Internal prediction processes are consistent across languages and align with English.
Harder benchmarks show less observable latent reasoning.
Abstract
Large reasoning models (LRMs) achieve strong performance on mathematical reasoning tasks, often attributed to their capability to generate explicit chain-of-thought (CoT) explanations. However, recent work shows that LRMs often arrive at the correct answer before completing these textual reasoning steps, indicating the presence of latent reasoning -- internal, non-verbal computation encoded in hidden states. While this phenomenon has been explored in English, its multilingual behavior remains largely unknown. In this paper, we conduct a systematic investigation of multilingual latent reasoning in LRMs across 11 languages. Using a truncation-based strategy, we examine how the correct answer emerges as the model is given only partial reasoning traces, allowing us to measure stepwise latent prediction formation. Our results reveal clear evidence of multilingual latent reasoning, though…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
