Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
Vivek Kumar, Rishabh Maheshwary, Vikram Pudi

TL;DR
This paper introduces data augmentation techniques to enhance the generalization and robustness of math word problem solvers, significantly improving their performance and ability to handle diverse problems.
Contribution
The paper proposes novel data augmentation methods that increase dataset size fivefold, leading to improved accuracy and robustness of MWP solvers across benchmarks.
Findings
Augmented datasets improve solver accuracy by over five percentage points.
Solvers trained on augmented data perform better on challenge test sets.
Proposed techniques are validated through ablation studies and human evaluation.
Abstract
Existing Math Word Problem (MWP) solvers have achieved high accuracy on benchmark datasets. However, prior works have shown that such solvers do not generalize well and rely on superficial cues to achieve high performance. In this paper, we first conduct experiments to showcase that this behaviour is mainly associated with the limited size and diversity present in existing MWP datasets. Next, we propose several data augmentation techniques broadly categorized into Substitution and Paraphrasing based methods. By deploying these methods we increase the size of existing datasets by five folds. Extensive experiments on two benchmark datasets across three state-of-the-art MWP solvers show that proposed methods increase the generalization and robustness of existing solvers. On average, proposed methods significantly increase the state-of-the-art results by over five percentage points on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques
