MuMath-Code: Combining Tool-Use Large Language Models with   Multi-perspective Data Augmentation for Mathematical Reasoning

Shuo Yin; Weihao You; Zhilong Ji; Guoqiang Zhong; Jinfeng Bai

arXiv:2405.07551·cs.CL·May 14, 2024·1 cites

MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning

Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai

PDF

Open Access 1 Repo 1 Video

TL;DR

MuMath-Code combines data augmentation and external tool use in large language models to significantly improve mathematical reasoning, achieving state-of-the-art results on benchmark datasets.

Contribution

This work introduces a novel integration of multi-perspective data augmentation with tool-use LLMs, enhancing mathematical reasoning capabilities.

Findings

01

MuMath-Code achieves 90.7% on GSM8K.

02

It attains 55.1% on MATH.

03

The two-stage training strategy improves performance.

Abstract

The tool-use Large Language Models (LLMs) that integrate with external Python interpreters have significantly enhanced mathematical reasoning capabilities for open-source LLMs, while tool-free methods chose another track: augmenting math reasoning data. However, a great method to integrate the above two research paths and combine their advantages remains to be explored. In this work, we firstly include new math questions via multi-perspective data augmenting methods and then synthesize code-nested solutions to them. The open LLMs (i.e., Llama-2) are finetuned on the augmented dataset to get the resulting models, MuMath-Code ( $μ$ -Math-Code). During the inference phase, our MuMath-Code generates code and interacts with the external python interpreter to get the execution results. Therefore, MuMath-Code leverages the advantages of both the external tool and data augmentation. To fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

project-numina/aimo-progress-prize
pytorch

Videos

MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning· underline

Taxonomy

TopicsMathematics, Computing, and Information Processing · Model-Driven Software Engineering Techniques