Towards Adaptive Feedback with AI: Comparing the Feedback Quality of LLMs and Teachers on Experimentation Protocols
Kathrin Se{\ss}ler, Arne Bewersdorff, Claudia Nerdel, Enkelejda, Kasneci

TL;DR
This study compares the quality of feedback from large language models, teachers, and experts on student experimentation protocols, finding LLMs perform comparably overall but need improvement in error explanation and contextual understanding.
Contribution
It provides a systematic evaluation of LLM-generated feedback against human experts, highlighting strengths and limitations in educational feedback quality.
Findings
LLMs match teachers and experts in overall feedback quality
LLMs underperform in error identification and explanation
Combining LLMs with human input can improve educational feedback
Abstract
Effective feedback is essential for fostering students' success in scientific inquiry. With advancements in artificial intelligence, large language models (LLMs) offer new possibilities for delivering instant and adaptive feedback. However, this feedback often lacks the pedagogical validation provided by real-world practitioners. To address this limitation, our study evaluates and compares the feedback quality of LLM agents with that of human teachers and science education experts on student-written experimentation protocols. Four blinded raters, all professionals in scientific inquiry and science education, evaluated the feedback texts generated by 1) the LLM agent, 2) the teachers and 3) the science education experts using a five-point Likert scale based on six criteria of effective feedback: Feed Up, Feed Back, Feed Forward, Constructive Tone, Linguistic Clarity, and Technical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics
