Code Generation Based Grading: Evaluating an Auto-grading Mechanism for "Explain-in-Plain-English" Questions
David H. Smith IV, Craig Zilles

TL;DR
This paper explores a novel auto-grading approach for 'Explain-in-Plain-English' questions in programming courses, using code generation from student responses and test cases to evaluate correctness, aiming to reduce manual grading effort.
Contribution
It introduces 'Code Generation Based Grading' (CGBG), a new method leveraging large language models to automatically assess student explanations by generating code and verifying it with test cases.
Findings
CGBG achieves moderate agreement with human graders.
The primary disagreement is due to CGBG's leniency on detailed code descriptions.
CGBG can potentially streamline grading in programming education.
Abstract
Comprehending and elucidating the purpose of code is often cited as being a key learning objective within introductory programming courses. To address this objective ``Explain-in-Plain-English'' questions, in which students are shown a segment of code and asked to provide an abstract description of the code's purpose, have been adopted. However, given EiPE questions require a natural language response, they often require manual grading which is time-consuming for course staff and delays feedback for students. With the advent of large language models (LLMs) capable of generating code, responses to EiPE questions can be used to generate code segments, the correctness of which can then be easily verified using test cases. We refer to this approach as "Code Generation Based Grading" (CGBG) and in this paper we explore its agreement with human graders using EiPE responses from past exams in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Teaching and Learning Programming · Topic Modeling
