Large Language Models As MOOCs Graders
Shahriar Golchin, Nikhil Garuda, Christopher Impey, Matthew Wenger

TL;DR
This study investigates the use of large language models like GPT-4 and GPT-3.5 to automate grading in MOOCs, showing promising results especially with well-defined rubrics and specific prompting techniques.
Contribution
It demonstrates the feasibility of using LLMs with zero-shot-CoT prompting to improve grading accuracy in MOOCs, advancing automated assessment methods.
Findings
Zero-shot-CoT with instructor-provided answers improves grading alignment.
GPT-4 outperforms GPT-3.5 in grading accuracy.
Course subject complexity affects grading performance.
Abstract
Massive open online courses (MOOCs) unlock the doors to free education for anyone around the globe with access to a computer and the internet. Despite this democratization of learning, the massive enrollment in these courses means it is almost impossible for one instructor to assess every student's writing assignment. As a result, peer grading, often guided by a straightforward rubric, is the method of choice. While convenient, peer grading often falls short in terms of reliability and validity. In this study, using 18 distinct settings, we explore the feasibility of leveraging large language models (LLMs) to replace peer grading in MOOCs. Specifically, we focus on two state-of-the-art LLMs: GPT-4 and GPT-3.5, across three distinct courses: Introductory Astronomy, Astrobiology, and the History and Philosophy of Astronomy. To instruct LLMs, we use three different prompts based on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Position-Wise Feed-Forward Layer · Label Smoothing · Cosine Annealing · Absolute Position Encodings · Linear Layer · Byte Pair Encoding · Multi-Head Attention
