Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning
Piotr Pi\k{e}kos, Henryk Michalewski, Mateusz Malinowski

TL;DR
This paper explores BERT's mathematical reasoning abilities, fine-tunes it on math problem datasets, introduces new training tasks to enhance its performance, and achieves significant improvements over baseline models.
Contribution
The paper introduces Reasoning Order Prediction tasks to improve BERT's mathematical reasoning and demonstrates their effectiveness in enhancing model performance.
Findings
BERT can learn mathematical reasoning with proper fine-tuning.
Reasoning Order Prediction tasks improve BERT's accuracy on math problems.
Proposed methods reduce positional bias in language models.
Abstract
Imagine you are in a supermarket. You have two bananas in your basket and want to buy four apples. How many fruits do you have in total? This seemingly straightforward question can be challenging for data-driven language models, even if trained at scale. However, we would expect such generic language models to possess some mathematical abilities in addition to typical linguistic competence. Towards this goal, we investigate if a commonly used language model, BERT, possesses such mathematical abilities and, if so, to what degree. For that, we fine-tune BERT on a popular dataset for word math problems, AQuA-RAT, and conduct several tests to understand learned representations better. Since we teach models trained on natural language to do formal mathematics, we hypothesize that such models would benefit from training on semi-formal steps that explain how math results are derived. To better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Adam · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Attention Dropout · Dense Connections
