Fill in the Blank: Exploring and Enhancing LLM Capabilities for Backward Reasoning in Math Word Problems
Aniruddha Deb, Neeva Oza, Sarthak Singla, Dinesh Khandelwal, Dinesh, Garg, Parag Singla

TL;DR
This paper investigates the backward reasoning capabilities of large language models on math word problems, identifies their limitations, and proposes strategies including reformulation, external solvers, verification, and ensemble methods to enhance performance.
Contribution
It introduces the first systematic study of backward reasoning in LLMs for math problems and develops novel strategies and ensemble techniques to improve their accuracy.
Findings
Significant performance drop in backward reasoning compared to forward reasoning.
Proposed strategies improve LLM accuracy on backward reasoning tasks.
Ensemble method achieves state-of-the-art results in backward reasoning.
Abstract
While forward reasoning (i.e., find the answer given the question) has been explored extensively in recent literature, backward reasoning is relatively unexplored. We examine the backward reasoning capabilities of LLMs on Math Word Problems (MWPs): given a mathematical question and its answer, with some details omitted from the question, can LLMs effectively retrieve the missing information? On modifying three benchmark datasets for this task, to evaluate this task: GSM8k, SVAMP, and MultiArith, we find a significant drop in the accuracy of models on this task compared to forward reasoning across SOTA LLMs (GPT4, GPT3.5, PaLM-2, and LLaMa). Motivated by the fact backward reasoning can be seen as the ''inverse'' of forward reasoning, we propose variations of three different forward reasoning strategies to improve performance. Rephrase reformulates the given problem into a forward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning
MethodsBalanced Selection
