Towards Adaptive Feedback with AI: Comparing the Feedback Quality of   LLMs and Teachers on Experimentation Protocols

Kathrin Se{\ss}ler; Arne Bewersdorff; Claudia Nerdel; Enkelejda; Kasneci

arXiv:2502.12842·cs.AI·February 19, 2025·5 cites

Towards Adaptive Feedback with AI: Comparing the Feedback Quality of LLMs and Teachers on Experimentation Protocols

Kathrin Se{\ss}ler, Arne Bewersdorff, Claudia Nerdel, Enkelejda, Kasneci

PDF

Open Access

TL;DR

This study compares the quality of feedback from large language models, teachers, and experts on student experimentation protocols, finding LLMs perform comparably overall but need improvement in error explanation and contextual understanding.

Contribution

It provides a systematic evaluation of LLM-generated feedback against human experts, highlighting strengths and limitations in educational feedback quality.

Findings

01

LLMs match teachers and experts in overall feedback quality

02

LLMs underperform in error identification and explanation

03

Combining LLMs with human input can improve educational feedback

Abstract

Effective feedback is essential for fostering students' success in scientific inquiry. With advancements in artificial intelligence, large language models (LLMs) offer new possibilities for delivering instant and adaptive feedback. However, this feedback often lacks the pedagogical validation provided by real-world practitioners. To address this limitation, our study evaluates and compares the feedback quality of LLM agents with that of human teachers and science education experts on student-written experimentation protocols. Four blinded raters, all professionals in scientific inquiry and science education, evaluated the feedback texts generated by 1) the LLM agent, 2) the teachers and 3) the science education experts using a five-point Likert scale based on six criteria of effective feedback: Feed Up, Feed Back, Feed Forward, Constructive Tone, Linguistic Clarity, and Technical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics