Automated Feedback Generation for Undergraduate Mathematics: Development and Evaluation of an AI Teaching Assistant
Aron Gohr, Marie-Amelie Lawn, Kevin Gao, Inigo Serjeant, Stephen Heslip

TL;DR
This paper introduces an AI-powered system that provides automated, high-quality feedback on free-form mathematical reasoning for undergraduate students, matching human expert assessments and adaptable for educational platforms.
Contribution
We develop a modular, editable AI system using large language models capable of evaluating and commenting on free-form mathematical proofs, including style and presentation, with deployment in a real educational platform.
Findings
Feedback quality comparable to human experts
System handles a wide range of edge cases in mathematical reasoning
Successful deployment on an educational platform
Abstract
Intelligent tutoring systems have long enabled automated immediate feedback on student work when it is presented in a tightly structured format and when problems are very constrained, but reliably assessing free-form mathematical reasoning remains challenging. We present a system that processes free-form natural language input, handles a wide range of edge cases, and comments competently not only on the technical correctness of submitted proofs, but also on style and presentation issues. We discuss the advantages and disadvantages of various approaches to the evaluation of such a system, and show that by the metrics we evaluate, the quality of the feedback generated is comparable to that produced by human experts when assessing early undergraduate homework. We stress-test our system with a small set of more advanced and unusual questions, and report both significant gaps and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Mathematics, Computing, and Information Processing · Topic Modeling
