Using Large Language Models to Assign Partial Credit to Students' Explanations of Problem-Solving Process: Grade at Human Level Accuracy with Grading Confidence Index and Personalized Student-facing Feedback
Zhongzhou Chen, Tong Wan

TL;DR
This paper demonstrates that GPT-4o can reliably assign partial credit to student explanations in physics problems, matching human accuracy, providing confidence indices, and generating personalized feedback at a low cost, thus enabling scalable automated grading.
Contribution
The study introduces a novel method using GPT-4o for partial credit grading without reference answers, achieving human-level accuracy and providing confidence measures and feedback generation.
Findings
GPT-4o agrees with human graders 70-80% of the time.
Grading confidence index helps identify potentially incorrect grades.
Automated grading costs approximately $5 per 100 answers.
Abstract
This study examines the feasibility and potential advantages of using large language models, in particular GPT-4o, to perform partial credit grading of large numbers of student written responses to introductory level physics problems. Students were instructed to write down verbal explanations of their reasoning process when solving one conceptual and two numerical calculation problems on in class exams. The explanations were then graded according to a 3-item rubric with each item grades as binary (1 or 0). We first demonstrate that machine grading using GPT-4o with no examples nor reference answer can reliably agree with human graders on 70%-80% of all cases, which is equal to or higher than the level at which two human graders agree with each other. Two methods are essential for achieving this level of accuracy: 1. Adding explanation language to each rubric item that targets the errors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducation and Critical Thinking Development
