EfficientXLang: Towards Improving Token Efficiency Through Cross-Lingual Reasoning

Sanchit Ahuja; Praneetha Vaddamanu; Barun Patra

arXiv:2507.00246·cs.CL·July 2, 2025

EfficientXLang: Towards Improving Token Efficiency Through Cross-Lingual Reasoning

Sanchit Ahuja, Praneetha Vaddamanu, Barun Patra

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that reasoning in non-English languages can be more token-efficient and equally accurate compared to English in multilingual language reasoning models, emphasizing the value of multilingual reasoning.

Contribution

It provides empirical evidence that multilingual reasoning reduces token usage without sacrificing accuracy, highlighting the importance of multilingual foundations in language models.

Findings

01

Non-English reasoning reduces token usage.

02

Accuracy is preserved across languages.

03

Gains depend on multilingual model strength.

Abstract

Despite recent advances in Language Reasoning Models (LRMs), most research focuses solely on English, even though many models are pretrained on multilingual data. In this work, we investigate: Is English the most token-efficient language for reasoning? We evaluate three open-source RLMs: DeepSeek R1, Qwen 2.5 and Qwen 3, across four math datasets and seven typologically diverse languages. We find that reasoning in non-English languages not only reduces token usage, but also preserves accuracy. These gains persist even after translating the reasoning traces into English, suggesting genuine shifts in reasoning behavior rather than surface-level linguistic effects. The extent of improvement, however, depends on the models multilingual strength. Our findings motivate a broader view of reasoning in language models, highlighting the potential of multilingual reasoning and the importance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

EfficientXLang: Towards Improving Token Efficiency Through Cross-Lingual Reasoning· underline

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques