MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Longhui Yu, Weisen Jiang, Han Shi, Jincheng Yu, Zhengying Liu, Yu, Zhang, James T. Kwok, Zhenguo Li, Adrian Weller, Weiyang Liu

TL;DR
MetaMath introduces a fine-tuned LLaMA-2 model trained on a newly created dataset, MetaMathQA, which significantly improves mathematical reasoning performance on benchmarks like GSM8K and MATH.
Contribution
The paper presents MetaMath, a novel approach that bootstraps mathematical questions from multiple perspectives to enhance LLMs' reasoning abilities, outperforming existing open-source models.
Findings
MetaMath-7B achieves 66.4% on GSM8K and 19.4% on MATH.
MetaMath-70B achieves 82.3% accuracy on GSM8K, surpassing GPT-3.5-Turbo.
The MetaMathQA dataset and models are publicly released.
Abstract
Large language models (LLMs) have pushed the limits of natural language understanding and exhibited excellent problem-solving ability. Despite the great success, most existing open-source LLMs (e.g., LLaMA-2) are still far away from satisfactory for solving mathematical problem due to the complex reasoning procedures. To bridge this gap, we propose MetaMath, a fine-tuned language model that specializes in mathematical reasoning. Specifically, we start by bootstrapping mathematical questions by rewriting the question from multiple perspectives without extra knowledge, which results in a new dataset called MetaMathQA. Then we fine-tune the LLaMA-2 models on MetaMathQA. Experimental results on two popular benchmarks (i.e., GSM8K and MATH) for mathematical reasoning demonstrate that MetaMath outperforms a suite of open-source LLMs by a significant margin. Our MetaMath-7B model achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗choco9966/Llama-2-7b-instruct-tuningmodel· 6 dl6 dl
- 🤗meta-math/MetaMath-7B-V1.0model· 147 dl· ♡ 30147 dl♡ 30
- 🤗meta-math/MetaMath-13B-V1.0model· 1.0k dl· ♡ 131.0k dl♡ 13
- 🤗meta-math/MetaMath-70B-V1.0model· 901 dl· ♡ 17901 dl♡ 17
- 🤗akjindal53244/Arithmo-Mistral-7Bmodel· 781 dl· ♡ 62781 dl♡ 62
- 🤗meta-math/MetaMath-Mistral-7Bmodel· 1.2k dl· ♡ 961.2k dl♡ 96
- 🤗TheBloke/MetaMath-Mistral-7B-GGUFmodel· 373 dl· ♡ 10373 dl♡ 10
- 🤗TheBloke/MetaMath-Mistral-7B-GPTQmodel· 7 dl· ♡ 37 dl♡ 3
- 🤗TheBloke/MetaMath-Mistral-7B-AWQmodel· 10 dl· ♡ 110 dl♡ 1
- 🤗meta-math/MetaMath-Llemma-7Bmodel· 806 dl· ♡ 17806 dl♡ 17
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Cosine Annealing · Weight Decay · Multi-Head Attention · Adam · Residual Connection · Attention Dropout
