A Survey on Large Language Models for Mathematical Reasoning

Peng-Yuan Wang; Tian-Shuo Liu; Chenyang Wang; Yi-Di Wang; Shu Yan; Cheng-Xing Jia; Xu-Hui Liu; Xin-Wei Chen; Jia-Cheng Xu; Ziniu Li; Yang Yu

arXiv:2506.08446·cs.AI·June 11, 2025

A Survey on Large Language Models for Mathematical Reasoning

Peng-Yuan Wang, Tian-Shuo Liu, Chenyang Wang, Yi-Di Wang, Shu Yan, Cheng-Xing Jia, Xu-Hui Liu, Xin-Wei Chen, Jia-Cheng Xu, Ziniu Li, Yang Yu

PDF

Open Access

TL;DR

This survey reviews recent advances in large language models' mathematical reasoning abilities, focusing on comprehension and answer generation, and discusses methods, challenges, and future research directions.

Contribution

It provides a comprehensive overview of methods and challenges in enhancing LLMs' mathematical reasoning, highlighting recent innovations like extended CoT and test-time scaling.

Findings

01

Progress in step-by-step reasoning techniques

02

Challenges in model capacity and generalization

03

Promising directions include knowledge augmentation and formal reasoning

Abstract

Mathematical reasoning has long represented one of the most fundamental and challenging frontiers in artificial intelligence research. In recent years, large language models (LLMs) have achieved significant advances in this area. This survey examines the development of mathematical reasoning abilities in LLMs through two high-level cognitive phases: comprehension, where models gain mathematical understanding via diverse pretraining strategies, and answer generation, which has progressed from direct prediction to step-by-step Chain-of-Thought (CoT) reasoning. We review methods for enhancing mathematical reasoning, ranging from training-free prompting to fine-tuning approaches such as supervised fine-tuning and reinforcement learning, and discuss recent work on extended CoT and "test-time scaling". Despite notable progress, fundamental challenges remain in terms of capacity, efficiency,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Text Readability and Simplification