mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Huiyuan Lai, Malvina Nissim

TL;DR
This paper introduces mCoT, a multilingual instruction tuning method that enhances reasoning consistency across languages in large language models, demonstrated on a new multilingual math reasoning dataset.
Contribution
The paper presents the first large-scale multilingual math reasoning dataset and a multilingual CoT instruction tuning approach that improves reasoning consistency across diverse languages.
Findings
mCoT achieves high reasoning consistency across 11 languages.
The approach outperforms larger models in multilingual reasoning tasks.
Lesser-resourced languages benefit significantly from the tuning.
Abstract
Large language models (LLMs) with Chain-of-thought (CoT) have recently emerged as a powerful technique for eliciting reasoning to improve various downstream tasks. As most research mainly focuses on English, with few explorations in a multilingual context, the question of how reliable this reasoning capability is in different languages is still open. To address it directly, we study multilingual reasoning consistency across multiple languages, using popular open-source LLMs. First, we compile the first large-scale multilingual math reasoning dataset, mCoT-MATH, covering eleven diverse languages. Then, we introduce multilingual CoT instruction tuning to boost reasoning capability across languages, thereby improving model consistency. While existing LLMs show substantial variation across the languages we consider, and especially low performance for lesser resourced languages, our 7B…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
