A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level
Iddo Drori, Sarah Zhang, Reece Shuttleworth, Leonard Tang, Albert Lu,, Elizabeth Ke, Kevin Liu, Linda Chen, Sunny Tran, Newman Cheng, Roman Wang,, Nikhil Singh, Taylor L. Patti, Jayson Lynch, Avi Shporer, Nakul Verma, Eugene, Wu, Gilbert Strang

TL;DR
This paper demonstrates that a fine-tuned neural network, using program synthesis and few-shot learning, can solve, explain, and generate university-level mathematics problems at human performance levels, surpassing previous models significantly.
Contribution
It introduces a novel approach combining program synthesis and few-shot learning with Codex to solve and generate university math problems at scale, achieving unprecedented accuracy.
Findings
Achieves 81% automatic accuracy on university math questions.
Outperforms GPT-3 zero-shot and few-shot methods significantly.
First to automatically solve and generate university-level math problems at scale.
Abstract
We demonstrate that a neural network pre-trained on text and fine-tuned on code solves mathematics course problems, explains solutions, and generates new questions at a human level. We automatically synthesize programs using few-shot learning and OpenAI's Codex transformer and execute them to solve course problems at 81% automatic accuracy. We curate a new dataset of questions from MIT's largest mathematics courses (Single Variable and Multivariable Calculus, Differential Equations, Introduction to Probability and Statistics, Linear Algebra, and Mathematics for Computer Science) and Columbia University's Computational Linear Algebra. We solve questions from a MATH dataset (on Prealgebra, Algebra, Counting and Probability, Intermediate Algebra, Number Theory, and Precalculus), the latest benchmark of advanced mathematics problems designed to assess mathematical reasoning. We randomly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Is OpenAI’s AI As Smart As A University Student? 🤖· youtube
A Neural Network Solves and Generates Mathematics Problems by Program Synthesis | Paper Explained· youtube
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Weight Decay · Cosine Annealing · Layer Normalization · Softmax · Adam · Residual Connection
