mCoT: Multilingual Instruction Tuning for Reasoning Consistency in   Language Models

Huiyuan Lai; Malvina Nissim

arXiv:2406.02301·cs.CL·July 11, 2024

mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models

Huiyuan Lai, Malvina Nissim

PDF

Open Access 1 Repo 1 Models 2 Datasets

TL;DR

This paper introduces mCoT, a multilingual instruction tuning method that enhances reasoning consistency across languages in large language models, demonstrated on a new multilingual math reasoning dataset.

Contribution

The paper presents the first large-scale multilingual math reasoning dataset and a multilingual CoT instruction tuning approach that improves reasoning consistency across diverse languages.

Findings

01

mCoT achieves high reasoning consistency across 11 languages.

02

The approach outperforms larger models in multilingual reasoning tasks.

03

Lesser-resourced languages benefit significantly from the tuning.

Abstract

Large language models (LLMs) with Chain-of-thought (CoT) have recently emerged as a powerful technique for eliciting reasoning to improve various downstream tasks. As most research mainly focuses on English, with few explorations in a multilingual context, the question of how reliable this reasoning capability is in different languages is still open. To address it directly, we study multilingual reasoning consistency across multiple languages, using popular open-source LLMs. First, we compile the first large-scale multilingual math reasoning dataset, mCoT-MATH, covering eleven diverse languages. Then, we introduce multilingual CoT instruction tuning to boost reasoning capability across languages, thereby improving model consistency. While existing LLMs show substantial variation across the languages we consider, and especially low performance for lesser resourced languages, our 7B…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

laihuiyuan/mcot
pytorchOfficial

Models

🤗
laihuiyuan/mCoT
model· 8 dl· ♡ 2
8 dl♡ 2

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies