Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training
Junxiao Liu, Zhijun Wang, Yixiao Li, Zhejian Lai, Liqian Huang, Xin Huang, Xue Han, Junlan Feng, Shujian Huang

TL;DR
This paper introduces TRIT, a self-improving training framework that enhances multilingual reasoning by integrating translation training, leading to significant improvements in answer accuracy and language consistency without external data.
Contribution
TRIT is a novel framework that jointly trains translation and reasoning models, improving multilingual question understanding and reasoning accuracy without external multilingual datasets.
Findings
Outperforms baselines by 7 percentage points on MMATH.
Improves cross-lingual question alignment by over 10%.
Enhances translation quality with gains up to 8.4 COMET points.
Abstract
Long reasoning models often struggle in multilingual settings: they tend to reason in English for non-English questions; when constrained to reasoning in the question language, accuracies drop substantially. The struggle is caused by the limited abilities for both multilingual question understanding and multilingual reasoning. To address both problems, we propose TRIT (Translation-Reasoning Integrated Training), a self-improving framework that integrates the training of translation into multilingual reasoning. Without external feedback or additional multilingual data, our method jointly enhances multilingual question understanding and response generation. On MMATH, our method outperforms multiple baselines by an average of 7 percentage points, improving both answer correctness and language consistency. Further analysis reveals that integrating translation training improves cross-lingual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
