MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning
Debrup Das, Debopriyo Banerjee, Somak Aditya, Ashish Kulkarni

TL;DR
MathSensei is a tool-augmented large language model that significantly improves mathematical reasoning accuracy by integrating knowledge retrieval, program execution, and symbolic solving, especially on complex problems.
Contribution
This work introduces MathSensei, a novel tool-augmented LLM that combines multiple tools to enhance mathematical reasoning, demonstrating substantial performance gains on complex datasets.
Findings
MathSensei outperforms GPT-3.5-turbo by 13.5% on MATH dataset.
Tool integration benefits increase with problem complexity.
TALMs are less effective on simple math problems like GSM-8K.
Abstract
Tool-augmented Large Language Models (TALMs) are known to enhance the skillset of large language models (LLMs), thereby, leading to their improved reasoning abilities across many tasks. While, TALMs have been successfully employed in different question-answering benchmarks, their efficacy on complex mathematical reasoning benchmarks, and the potential complementary benefits offered by tools for knowledge retrieval and mathematical equation solving are open research questions. In this work, we present MathSensei, a tool-augmented large language model for mathematical reasoning. We study the complementary benefits of the tools - knowledge retriever (Bing Web Search), program generator + executor (Python), and symbolic equation solver (Wolfram-Alpha API) through evaluations on mathematical reasoning datasets. We perform exhaustive ablations on MATH, a popular dataset for evaluating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMathematics, Computing, and Information Processing · Natural Language Processing Techniques · Intelligent Tutoring Systems and Adaptive Learning
