Are NLP Models really able to Solve Simple Math Word Problems?

Arkil Patel; Satwik Bhattamishra; Navin Goyal

arXiv:2103.07191·cs.CL·April 16, 2021

Are NLP Models really able to Solve Simple Math Word Problems?

Arkil Patel, Satwik Bhattamishra, Navin Goyal

PDF

3 Repos

TL;DR

This paper reveals that current NLP models for elementary math word problems often rely on superficial heuristics, and introduces a new challenging dataset, SVAMP, showing room for improvement in solving simple MWPs.

Contribution

The paper demonstrates the limitations of existing NLP solvers on simple MWPs and introduces SVAMP, a challenging dataset to evaluate true problem-solving capabilities.

Findings

01

Existing solvers can solve MWPs without understanding the question.

02

Bag-of-words models achieve high accuracy on benchmark datasets.

03

Performance drops significantly on the SVAMP dataset.

Abstract

The problem of designing NLP solvers for math word problems (MWP) has seen sustained research activity and steady gains in the test accuracy. Since existing solvers achieve high performance on the benchmark datasets for elementary level MWPs containing one-unknown arithmetic word problems, such problems are often considered "solved" with the bulk of research attention moving to more complex MWPs. In this paper, we restrict our attention to English MWPs taught in grades four and lower. We provide strong evidence that the existing MWP solvers rely on shallow heuristics to achieve high performance on the benchmark datasets. To this end, we show that MWP solvers that do not have access to the question asked in the MWP can still solve a large fraction of MWPs. Similarly, models that treat MWPs as bag-of-words can also achieve surprisingly high accuracy. Further, we introduce a challenge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGoal-Driven Tree-Structured Neural Model · Graph-to-Tree MWP Solver