Advancing Mathematical Reasoning in Language Models: The Impact of   Problem-Solving Data, Data Synthesis Methods, and Training Stages

Zui Chen; Tianqiao Liu; Mi Tian; Qing Tong; Weiqi Luo; Zitao Liu

arXiv:2501.14002·cs.CL·March 25, 2025

Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages

Zui Chen, Tianqiao Liu, Mi Tian, Qing Tong, Weiqi Luo, Zitao Liu

PDF

Open Access

TL;DR

This paper investigates how problem-solving data, data synthesis methods, and training stages influence the mathematical reasoning abilities of large language models, leading to the development of the MathGPT-8B model.

Contribution

It demonstrates that problem-solving data and effective synthesis methods during pre-training significantly improve mathematical reasoning in LLMs, surpassing traditional general corpora and instruction fine-tuning.

Findings

01

Problem-solving data enhances mathematical reasoning more than general mathematical corpora.

02

Tutorship amplification synthesis method yields the best performance among data synthesis techniques.

03

Pre-training with problem-solving data outperforms fine-tuning in developing complex reasoning skills.

Abstract

Mathematical reasoning remains a challenging area for large language models (LLMs), prompting the development of math-specific LLMs such as LLEMMA, DeepSeekMath, and Qwen2-Math, among others. These models typically follow a two-stage training paradigm: pre-training with math-related corpora and post-training with problem datasets for supervised fine-tuning (SFT). Despite these efforts, the improvements in mathematical reasoning achieved through continued pre-training (CPT) are often less significant compared to those obtained via SFT. This study addresses this discrepancy by exploring alternative strategies during the pre-training phase, focusing on the use of problem-solving data over general mathematical corpora. We investigate three primary research questions: (1) Can problem-solving data enhance the model's mathematical reasoning capabilities more effectively than general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning

MethodsBalanced Selection · Shrink and Fine-Tune