LLM-based Automated Grading with Human-in-the-Loop
Yucheng Chu, Hang Li, Kaiqi Yang, Yasemin Copur-Gencturk, Jiliang Tang

TL;DR
This paper introduces GradeHITL, a human-in-the-loop framework leveraging large language models to improve automatic short answer grading accuracy by dynamically incorporating human expertise, surpassing existing automated methods.
Contribution
The paper presents a novel LLM-based human-in-the-loop approach for automated grading that adaptively refines rubrics with human input, achieving higher accuracy than prior fully automated methods.
Findings
GradeHITL outperforms existing automated grading methods.
Incorporating human feedback improves grading accuracy.
The approach approaches human-level grading performance.
Abstract
The rise of artificial intelligence (AI) technologies, particularly large language models (LLMs), has brought significant advancements to the field of education. Among various applications, automatic short answer grading (ASAG), which focuses on evaluating open-ended textual responses, has seen remarkable progress with the introduction of LLMs. These models not only enhance grading performance compared to traditional ASAG approaches but also move beyond simple comparisons with predefined "golden" answers, enabling more sophisticated grading scenarios, such as rubric-based evaluation. However, existing LLM-powered methods still face challenges in achieving human-level grading performance in rubric-based assessments due to their reliance on fully automated approaches. In this work, we explore the potential of LLMs in ASAG tasks by leveraging their interactive capabilities through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Multimodal Machine Learning Applications
