Pretrained Language Models are Symbolic Mathematics Solvers too!
Kimia Noorbakhsh, Modar Sulaiman, Mahdi Sharifi, Kallol Roy, Pooyan, Jamshidi

TL;DR
This paper demonstrates that pretrained language models can efficiently solve symbolic mathematics problems, achieving high accuracy with significantly less training data by leveraging language translation pretraining.
Contribution
The authors introduce a sample-efficient pretraining approach for transformer models using language translation, improving symbolic mathematics solving with less data and analyzing language bias and robustness.
Findings
Achieves comparable integration accuracy with 1.5 orders of magnitude less data.
Lower accuracy on differential equations due to higher-order recursions.
Pretraining with language translation enhances generalization and robustness.
Abstract
Solving symbolic mathematics has always been of in the arena of human ingenuity that needs compositional reasoning and recurrence. However, recent studies have shown that large-scale language models such as transformers are universal and surprisingly can be trained as a sequence-to-sequence task to solve complex mathematical equations. These large transformer models need humongous amounts of training data to generalize to unseen symbolic mathematics problems. In this paper, we present a sample efficient way of solving the symbolic tasks by first pretraining the transformer model with language translation and then fine-tuning the pretrained transformer model to solve the downstream task of symbolic mathematics. We achieve comparable accuracy on the integration task with our pretrained model while using around orders of magnitude less number of training samples with respect to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Model Reduction and Neural Networks
MethodsTest
