MathDivide: Improved mathematical reasoning by large language models
Saksham Sahai Srivastava, Ashutosh Gandhi

TL;DR
MathDivide is a novel prompting technique that enhances large language models' mathematical reasoning by decomposing problems into subproblems, using Python code evaluation, and refining answers to improve accuracy.
Contribution
The paper introduces MathDivide, a new prompting method that significantly improves LLM performance on math problems by problem decomposition and code-based evaluation.
Findings
MathDivide outperforms Math-prompter on GSM8K dataset.
It effectively decomposes complex problems into simpler subproblems.
The approach works on both open-source and closed-source LLMs.
Abstract
Large language models have been proven to be capable of handling complex linguistic and cognitive tasks. Therefore their usage has been extended to tasks requiring logical reasoning ability such as Mathematics. In this paper, we propose a prompting technique called MathDivide that breaks down the mathematical problem into simpler subproblems. Each of the subproblems is formulated as an algebraic expression whose value is evaluated by the Python code generated by the LLM for the corresponding algebraic expression. The values fed to the Python code are the numerical values provided in the problem statement. The solutions for the subproblems are composed together to obtain the final answer for the problem statement. Finally, the final answer is compared to the correct answer. If the final answer matches the correct answer, it is produced as output else a refinement prompt is fed to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Mathematics, Computing, and Information Processing · Natural Language Processing Techniques
