Can Large Language Models Replicate ITS Feedback on Open-Ended Math   Questions?

Hunter McNichols; Jaewook Lee; Stephen Fancsali; Steve Ritter; Andrew; Lan

arXiv:2405.06414·cs.CL·July 10, 2024·2 cites

Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?

Hunter McNichols, Jaewook Lee, Stephen Fancsali, Steve Ritter, Andrew, Lan

PDF

Open Access 1 Repo

TL;DR

This study investigates whether large language models can generate effective feedback for open-ended math questions in ITS, finding they replicate training data well but struggle with unseen errors, indicating limited understanding of mathematical mistakes.

Contribution

The paper evaluates LLMs' ability to produce math feedback in ITS, highlighting their strengths in formatting but limitations in understanding diverse student errors.

Findings

01

LLMs replicate training feedback effectively

02

Models do not generalize well to unseen errors

03

LLMs show limited understanding of mathematical mistakes

Abstract

Intelligent Tutoring Systems (ITSs) often contain an automated feedback component, which provides a predefined feedback message to students when they detect a predefined error. To such a feedback component, we often resort to template-based approaches. These approaches require significant effort from human experts to detect a limited number of possible student errors and provide corresponding feedback. This limitation is exemplified in open-ended math questions, where there can be a large number of different incorrect errors. In our work, we examine the capabilities of large language models (LLMs) to generate feedback for open-ended math questions, similar to that of an established ITS that uses a template-based approach. We fine-tune both open-source and proprietary LLMs on real student responses and corresponding ITS-provided feedback. We measure the quality of the generated feedback…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

umass-ml4ed/its_feedback_edm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Natural Language Processing Techniques