Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages
Zui Chen, Tianqiao Liu, Mi Tian, Qing Tong, Weiqi Luo, Zitao Liu

TL;DR
This paper investigates how problem-solving data, data synthesis methods, and training stages influence the mathematical reasoning abilities of large language models, leading to the development of the MathGPT-8B model.
Contribution
It demonstrates that problem-solving data and effective synthesis methods during pre-training significantly improve mathematical reasoning in LLMs, surpassing traditional general corpora and instruction fine-tuning.
Findings
Problem-solving data enhances mathematical reasoning more than general mathematical corpora.
Tutorship amplification synthesis method yields the best performance among data synthesis techniques.
Pre-training with problem-solving data outperforms fine-tuning in developing complex reasoning skills.
Abstract
Mathematical reasoning remains a challenging area for large language models (LLMs), prompting the development of math-specific LLMs such as LLEMMA, DeepSeekMath, and Qwen2-Math, among others. These models typically follow a two-stage training paradigm: pre-training with math-related corpora and post-training with problem datasets for supervised fine-tuning (SFT). Despite these efforts, the improvements in mathematical reasoning achieved through continued pre-training (CPT) are often less significant compared to those obtained via SFT. This study addresses this discrepancy by exploring alternative strategies during the pre-training phase, focusing on the use of problem-solving data over general mathematical corpora. We investigate three primary research questions: (1) Can problem-solving data enhance the model's mathematical reasoning capabilities more effectively than general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning
MethodsBalanced Selection · Shrink and Fine-Tune
