Fill in the Blank: Exploring and Enhancing LLM Capabilities for Backward   Reasoning in Math Word Problems

Aniruddha Deb; Neeva Oza; Sarthak Singla; Dinesh Khandelwal; Dinesh; Garg; Parag Singla

arXiv:2310.01991·cs.CL·July 9, 2024

Fill in the Blank: Exploring and Enhancing LLM Capabilities for Backward Reasoning in Math Word Problems

Aniruddha Deb, Neeva Oza, Sarthak Singla, Dinesh Khandelwal, Dinesh, Garg, Parag Singla

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the backward reasoning capabilities of large language models on math word problems, identifies their limitations, and proposes strategies including reformulation, external solvers, verification, and ensemble methods to enhance performance.

Contribution

It introduces the first systematic study of backward reasoning in LLMs for math problems and develops novel strategies and ensemble techniques to improve their accuracy.

Findings

01

Significant performance drop in backward reasoning compared to forward reasoning.

02

Proposed strategies improve LLM accuracy on backward reasoning tasks.

03

Ensemble method achieves state-of-the-art results in backward reasoning.

Abstract

While forward reasoning (i.e., find the answer given the question) has been explored extensively in recent literature, backward reasoning is relatively unexplored. We examine the backward reasoning capabilities of LLMs on Math Word Problems (MWPs): given a mathematical question and its answer, with some details omitted from the question, can LLMs effectively retrieve the missing information? On modifying three benchmark datasets for this task, to evaluate this task: GSM8k, SVAMP, and MultiArith, we find a significant drop in the accuracy of models on this task compared to forward reasoning across SOTA LLMs (GPT4, GPT3.5, PaLM-2, and LLaMa). Motivated by the fact backward reasoning can be seen as the ''inverse'' of forward reasoning, we propose variations of three different forward reasoning strategies to improve performance. Rephrase reformulates the given problem into a forward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dair-iitd/fill-in-the-blank-mwp
noneOfficial

Videos

Fill in the Blank: Exploring and Enhancing LLM Capabilities for Backward Reasoning in Math Word Problems· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning

MethodsBalanced Selection