Align to the Pivot: Dual Alignment with Self-Feedback for Multilingual Math Reasoning
Chunxu Zhao, Xin Huang, Xue Han, Shujian Huang, Chao Deng, Junlan Feng

TL;DR
This paper introduces PASMR, a novel training method that improves multilingual math reasoning in large language models by aligning reasoning patterns through a pivot language and self-feedback, especially benefiting low-resource languages.
Contribution
The paper proposes a new training approach that enhances multilingual reasoning in LLMs by using a pivot language and self-feedback, without external supervision.
Findings
Improved reasoning accuracy across multiple languages.
Enhanced understanding of questions in low-resource languages.
Significant performance gains demonstrated in experiments.
Abstract
Despite the impressive reasoning abilities demonstrated by large language models (LLMs), empirical evidence indicates that they are not language agnostic as expected, leading to performance declines in multilingual settings, especially for low-resource languages. We attribute the decline to the model's inconsistent multilingual understanding and reasoning alignment. To address this, we present Pivot-Aligned Self-Feedback Multilingual Reasoning (PASMR), aiming to improve the alignment of multilingual math reasoning abilities in LLMs. This approach designates the model's primary language as the pivot language. During training, the model first translates questions into the pivot language to facilitate better alignment of reasoning patterns. The reasoning process in the target language is then supervised by the pivot language's reasoning answers, thereby establishing a cross-lingual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Multimodal Machine Learning Applications
