Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course
Pavlin G. Poli\v{c}ar, Martin \v{S}pendl, Toma\v{z} Curk, Bla\v{z}, Zupan

TL;DR
This study evaluates the effectiveness of large language models in providing personalized assignment feedback in a bioinformatics course, demonstrating comparable accuracy to human graders and highlighting open-source models as a viable, privacy-preserving alternative.
Contribution
It offers a practical evaluation of LLM-based grading in a real classroom, comparing multiple models and showing their potential to replace or supplement human feedback.
Findings
LLMs can match human grading accuracy with proper prompts.
Open-source LLMs perform as well as commercial ones.
Students rated LLM-generated feedback as high quality.
Abstract
Providing students with individualized feedback through assignments is a cornerstone of education that supports their learning and development. Studies have shown that timely, high-quality feedback plays a critical role in improving learning outcomes. However, providing personalized feedback on a large scale in classes with large numbers of students is often impractical due to the significant time and effort required. Recent advances in natural language processing and large language models (LLMs) offer a promising solution by enabling the efficient delivery of personalized feedback. These technologies can reduce the workload of course staff while improving student satisfaction and learning outcomes. Their successful implementation, however, requires thorough evaluation and validation in real classrooms. We present the results of a practical evaluation of LLM-based graders for written…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning
