Exploring Mathematical Extrapolation of Large Language Models with   Synthetic Data

Haolong Li; Yu Ma; Yinqi Zhang; Chen Ye; Jie Chen

arXiv:2406.02100·cs.CL·June 5, 2024

Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

Haolong Li, Yu Ma, Yinqi Zhang, Chen Ye, Jie Chen

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that fine-tuning large language models with synthetic data can significantly improve their multi-step mathematical reasoning abilities, including generalization to out-of-domain problems.

Contribution

It introduces a novel arithmetical puzzle problem and shows that synthetic data fine-tuning enhances LLMs' reasoning and generalization capabilities.

Findings

01

Fine-tuned open-llama-3B achieves 0.44 pass@1 on in-domain data.

02

Models reach 0.33 and 0.35 pass@1 on extended out-of-domain datasets.

03

Synthetic data fine-tuning improves multi-step reasoning in LLMs.

Abstract

Large Language Models (LLMs) have shown excellent performance in language understanding, text generation, code synthesis, and many other tasks, while they still struggle in complex multi-step reasoning problems, such as mathematical reasoning. In this paper, through a newly proposed arithmetical puzzle problem, we show that the model can perform well on multi-step reasoning tasks via fine-tuning on high-quality synthetic data. Experimental results with the open-llama-3B model on three different test datasets show that not only the model can reach a zero-shot pass@1 at 0.44 on the in-domain dataset, it also demonstrates certain generalization capabilities on the out-of-domain datasets. Specifically, this paper has designed two out-of-domain datasets in the form of extending the numerical range and the composing components of the arithmetical puzzle problem separately. The fine-tuned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques