System-2 Mathematical Reasoning via Enriched Instruction Tuning

Huanqia Cai; Yijun Yang; Zhifeng Li

arXiv:2412.16964·cs.AI·December 25, 2024

System-2 Mathematical Reasoning via Enriched Instruction Tuning

Huanqia Cai, Yijun Yang, Zhifeng Li

PDF

Open Access

TL;DR

This paper introduces Enriched Instruction Tuning (EIT), a novel method that enhances large language models' mathematical reasoning by creating detailed reasoning trajectories through human and AI feedback, significantly improving their problem-solving accuracy.

Contribution

The paper presents EIT, a new fine-tuning approach that enriches reasoning data with detailed plans and steps, outperforming existing methods in mathematical problem-solving accuracy.

Findings

01

EIT achieves 84.1% accuracy on GSM8K.

02

EIT reaches 32.5% accuracy on MATH.

03

EIT surpasses state-of-the-art fine-tuning and prompting methods.

Abstract

Solving complex mathematical problems via system-2 reasoning is a natural human skill, yet it remains a significant challenge for current large language models (LLMs). We identify the scarcity of deliberate multi-step reasoning data as a primary limiting factor. To this end, we introduce Enriched Instruction Tuning (EIT), a method that enriches existing human-annotated mathematical datasets by synergizing human and AI feedback to create fine-grained reasoning trajectories. These datasets are then used to fine-tune open-source LLMs, enhancing their mathematical reasoning abilities without reliance on any symbolic verification program. Concretely, EIT is composed of two critical steps: Enriching with Reasoning Plan (ERP) and Enriching with Reasoning Step (ERS). The former generates a high-level plan that breaks down complex instructions into a sequence of simpler objectives, while ERS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Neural Networks and Applications · Intelligent Tutoring Systems and Adaptive Learning

MethodsChain-of-thought prompting